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Preface 


This quarterly publication provides archival reports on developments in 
programs managed by JPL’s Office of Telecommunications and Data Acquisition 
(TDA). In space communications, radio navigation, radio science, and ground-based 
radio and radar astronomy, it reports on activities of the Deep Space Network (DSN) 
and its associated Ground Communications Facility (GCF) in planning, in support- 
ing research and technology, in implementation, and in operations. Also included 
is TDA-funded activity at JPL on data and information systems and reimbursable 
DSN work performed for other space agencies through NASA. The preceding work 
is all performed for NASA’s Office of Space Operations (OSO). The TDA Office 
also performs work funded by two other NASA program offices through and with 
the cooperation of the Office of Space Operations. These are the Orbital Debris 
Radar Program (with the Office of Space Station) and 21st Century Communication 
Studies (with the Office of Exploration). 

In the search for extraterrestrial intelligence (SETI), the TDA Progress Report 
reports on implementation and operations for searching the microwave spectrum. In 
solar system radar, it reports on the uses of the Goldstone Solar System Radar for 
scientific exploration of the planets, their rings and satellites, asteroids, and comets. 
In radio astronomy, the areas of support include spectroscopy, very long baseline 
interferometry, and astrometry. These three programs are performed for NASA’s 
Office of Space Science and Applications (OSSA), with support by the Office of 
Space Operations for the station support time. 

Finally, tasks funded under the JPL Director’s Discretionary Fund and the 
Caltech President’s Fund which involve the TDA Office are included. 

This and each succeeding issue of the TDA Progress Report will present mate- 
rial in some, but not necessarily all, of the following categories: 

OSO Tasks: 

DSN Advanced Systems 

Tracking and Ground-Based Navigation 
Communications, Spacecraft-Ground 
Station Control and System Technology 
Network Data Processing and Productivity 
DSN Systems Implementation 
Capabilities for Existing Projects 
Capabilities for New Projects 
New Initiatives 

Network Upgrade and Sustaining 
DSN Operations 

Network Operations and Operations Support 
Mission Interface and Support 
TDA Program Management and Analysis 
Communications Implementation and Operations 
Data and Information Systems 
Flight-Ground Advanced Engineering 

OSO Cooperative Tasks: 

Orbital Debris Radar Program 
21st Century Communication Studies 


iii 


OSSA Tasks: 

Search for Extraterrestrial Intelligence 
Goldstone Solar System Radar 
Radio Astronomy 


Discretionary Funded Tasks 
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Determination of Earth Orientation Using the 
Global Positioning System 

A. P. Freedman 

Tracking Systems and Applications Section 


Modern spacecraft tracking and navigation require highly accurate Earth-orien- 
tation parameters. For n ear- real- time applications, errors in these quantities and 
their extrapolated values are a significant error source. A globally distributed net- 
work of high-precision receivers observing the full Global Positioning System (GPS) 
configuration of 18 or more satellites may be an efficient and economical method 
for the rapid determination of short-term variations in Earth orientation. 

A covariance analysis utilizing the JPL Orbit Analysis and Simulation Software 
(OASIS) has been performed to evaluate the errors associated with GPS measure- 
ments of Earth orientation. These GPS measurements appear to be highly com- 
petitive with those from other techniques and can potentially yield frequent and 
reliable centimeter-level Earth-orientation information while simultaneously allow- 
ing the oversubscribed Deep Space Network (DSN) antennas to be used more for 
direct project support. 


I. Introduction 

Knowledge of the Earth's orientation in space is criti- 
cal to the operation of NASA's Deep Space Network 
(DSN). Unless the orientation is closely monitored, the 
variable rotation of the Earth can lead to errors in space- 
craft navigation. In near-re al-time, high-precision space- 
craft-tracking applications, the need for up-to-date Earth- 
orientation information is particularly crucial. The 
Magellan mission to Venus, for example, requires that 
Earth-rotation errors be kept under 30 cm; 1 by the mid- 
1990s, missions are envisioned that would require contin- 


1 T. F. Runge, “UTPM calibration accuracy for Magellan,” 
JPL IOM 335.5-87.81 (internal document), Jet Propulsion 

Laboratory, Pasadena, California, April 30, 1987. 


uously available Earth-orientation knowledge accurate to 
3 cm. 2 

The Navstar Global Positioning System (GPS) is a 
network of orbiting radio transmitters designed for naviga- 
tion purposes that is revolutionizing terrestrial distance de- 
terminations. The completion of the full satellite network 
by the early 1990s promises to extend recent improvements 
in regional geodetic measurements with GPS to a global 
scale [1, 2]. The capability of GPS to pinpoint receiver 
locations at the centimeter or subcentimeter level in a ter- 


2 R. Treuhaft and L. Wood, “Revisions in the differential VLBI 
error budget and applications for navigation in future mis- 
sions,” JPL IOM 335.4-601 (internal document), Jet Propul- 
sion Laboratory, Pasadena, California, December 31, 1986. 
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restrial reference frame that is precisely tied to an inertial 
frame suggests that GPS may be effective in monitoring 
Earth-orientation changes. 

Earth orientation consists of three parts: the angle 
of rotation of the Earth about its rotation axis relative to 
a mean rotation angle (UT1-UTC), the position of the 
current axis of rotation of the Earth with respect to a 
reference axis tied to the crust and mantle of the solid 
Earth (polar motion), and the orientation of the rotation 
axis in inertial space (precession and nutation). The first 
two of these parameters, UT1 and polar motion, collec- 
tively known as UTPM (Fig. 1), vary as a result of an- 
gular momentum exchange between the solid parts of the 
Earth and its atmosphere, oceans, and fluid core. These 
Earth-orientation components can vary rapidly and unpre- 
dictably. Nutations and precession are primarily products 
of Earth’s interactions with other celestial bodies and are 
largely periodic; they will not be dealt with in this article. 

Earth orientation 3 is currently being monitored by a 
number of precise geodetic techniques: Very Long Baseline 
Interferometry (VLBI), Satellite Laser Ranging (SLR), and 
Lunar Laser Ranging (LLR). These techniques can cur- 
rently achieve measurement accuracies of up to 2-3 cm 
over time scales as short as one hour [3]. The turnaround 
time necessary to collect and process these raw data can be 
quite long, however, VLBI from IRIS (International Ra- 
dio Interferometric Surveying) provides daily UT1-UTC 
and five-day UTPM data, while SLR from CSR (Center 
for Space Research, University of Texas, Austin) provides 
three-day polar- motion data. The results, however, are 
not usually available until a week or more after the epoch 
at which the measurements are valid. This delay is not 
acceptable for many DSN navigation needs. VLBI data 
obtained by JPL using the DSN antennas are known as 
TEMPO (Time and Earth Motion Precision Observations) 
and can be processed rapidly when necessary; demands for 
radio-telescope time limit the frequency of observations to 
weekly, however. Prompt reduction of data can be crit- 
ical to navigation since Earth orientation is continuously 


3 Earth orientation is measured in a variety of units. Polar mo- 
tion is essentially an angular displacement, while rotational 

variations can be expressed either as angular or temporal dis- 
placements. Both can also be expressed as a distance corre- 
sponding to the angular displacement measured at one Earth 
radius. Thus 1 cm at the Earth’s surface is equivalent to an 
angular distance of approximately 0.3 marcsec or 1.6 nrad. 
The time it takes for the Earth to rotate through this angle 
and move a point at the equator to the east by 1 cm is ap- 
proximately 0,02 msec of time. Thus 30 cm corresponds to 
0.65 msec of Earth rotation or 9.6 marcsec (47 nrad) of polar 
motion. 


changing. Earth rotation, in particular, is highly variable, 
and changes of 25 cm per day have been known to oc- 
cur. Thus no system presently active is likely to meet the 
long-term DSN needs of regular, high-precision, daily mon- 
itoring of Earth orientation with data reduction times of 
less than one day. 

VLBI, SLR, and LLR are, in addition, labor inten- 
sive and require significant investments in equipment and 
personnel. In VLBI, large radio telescopes are required — 
telescopes whose valuable observation time is in great de- 
mand at sites such as the DSN. The laser techniques use 
dedicated stations to obtain data but are subject to the 
vagaries of local weather conditions; thus they are not a 
reliable source for regular daily measurements. 

If GPS technology could produce Earth-orientation 
measurements of a quality comparable to that produced by 
VLBI, SLR, and LLR, it would free up significant amounts 
of time on the DSN and other VLBI networks currently 
being used to monitor Earth orientation. It would, more- 
over, allow measurements to be made more frequently than 
is now practical with VLBI. In addition, the use of radio- 
frequency energy would enable GPS systems to be much 
less sensitive to weather than optical systems. GPS would 
not replace these VLBI, SLR, and LLR techniques (due 
to systematic difficulties described below); rather, GPS 
systems would be employed in a synergistic combination 
with these present-day techniques to enhance overall per- 
formance. 

Frequent high-precision Earth-orientation data is also 
of value for scientific studies. Little is known about ex- 
changes of angular momentum between the solid Earth and 
the atmosphere, or about the excitation of polar motions, 
at periods of a week or less. Continuous GPS monitoring 
would help to extend our knowledge to higher frequencies, 
with benefit to geophysics, meteorology, and astronomy. 
Weather forecasting, in particular, may benefit from inde- 
pendent estimates of daily atmospheric angular momen- 
tum as provided by geodetic Earth-orientation measure- 
ments. 

A GPS receiver and data-processing system is sched- 
uled to be in place at each of the DSN sites within a year 
to enable highly accurate, ne sir-real- time ionospheric cal- 
ibration in support of deep-space missions that transmit 
at a single frequency. 4 This system is designed to have 


* C. J. Vegos, “DSCC Media Calibration Subsystem (DMD), 
Functional Design Review (Level D),” JPL 834-30, vol. 1 
(internal document), Jet Propulsion Laboratory, Pasadena, 
California, May 1, 1987. 
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a data turnaround capability of about 12 hours. These 
receivers are also expected to be part of an international, 
global GPS tracking network for the TOPEX/POSEIDON 
mission [4], By 1992, therefore, a GPS system should be 
in place to support the continuous monitoring of UTPM. 

This article documents a covariance analysis evalu- 
ating the potential of GPS for measuring Earth orienta- 
tion. The assumed satellite constellation is that originally 
proposed by the U.S. Department of Defense as the op- 
erational configuration. It consists of 18 satellites with 
12-hour orbits and lying in six orbit planes equidistantly 
placed in longitude, three satellites per orbit plane. This 
constellation enables at least five satellites to be seen and 
tracked most of the time from anywhere on the Earth’s 
surface [5]. Although the Air Force has subsequently mod- 
ified this constellation to include up to 24 satellites, these 
changes should not significantly alter the conclusions of 
this study. The network of ground receivers is assumed to 
consist of six sites regularly spaced around the globe (three 
of these coincident with the DSN sites). 

These sites are assumed to be equipped with receivers 
that yield two distinct observables: “carrier phase ” based 
on measurements of the radio frequency (RF) carrier that 
is transmitted by the GPS satellites, and “pseudorange,” 
based on a precise modulation of the transmitted signal. 
Both observables are indicators of the distance between a 
satellite and a receiver. Pseudorange consists of the light 
travel time between the two points, plus any clock offsets 
of the receiver and transmitter. It is often corrupted by 
multipath effects and is therefore the noisier data type. 
Carrier phase monitors the relative position change be- 
tween the satellite and ground station. It is a cleaner data 
set, but the absolute distance is made ambiguous by a con- 
stant bias (equal to an integer number of wavelengths) for 
each continuously measured satellite arc. 

Contemporary receiver capabilities are better for car- 
rier phase than for pseudorange, but the quality of pseudo- 
range data is rapidly improving. The GPS receiver being 
installed at the DSN sites can now achieve, under opti- 
mum conditions, pseudorange noise levels as low as 5 cm 
(averaged over 30 minutes), as well as carrier phase noise 
levels well below 0.5 cm [6]. By about 1992, when the full 
GPS constellation is active, such high-precision pseudor- 
ange and carrier phase data should be routinely available 
from numerous sites around the globe. 

II. Covariance Analysis 

This study utilized the Orbit Analysis and Simulation 
Software (OASIS) program, developed at JPL for the co- 


variance and simulation analysis of Earth-orbiting satel- 
lites [7]. It consists of a number of independent mod- 
ules: PV integrates the satellite orbits and computes the 
variational partial derivatives for satellite-related param- 
eters; REGRES-PMOD generates simulated observations 
and their measurement partials; OAFILTER does the ac- 
tual covariance analysis, i.e., using specified uncertainties 
of the data and the models, the program estimates the un- 
certainties in desired parameters; and U DIGEST generates 
the desired output. Parameters can be either estimated 
(adjusted) or “considered”; “considered” parameters axe 
treated as systematic error sources [8]. Adjusted param- 
eters can be modeled either as constants or as stochastic 
variables. 

Parameters that can be adjusted include 

(1) satellite epoch states, i.e., their initial positions 
and velocities 

(2) various satellite force-model parameters such as 
solar- radiation pressure and Y-bias 

(3) satellite and station clock offsets 

(4) station locations 

(5) wet-zenith tropospheric path delay for each sta- 
tion 

(6) Earth-orientation parameters 

(7) gravitational harmonic coefficients and the value 
of GM (the gravitational constant, G, multiplied 
by the mass of the Earth, M) 

(8) geocenter offset 

(9) carrier phase biases 

To determine Earth orientation with GPS, one needs 
to know the precise position and orientation of a set of 
points on the Earth’s surface with respect to an inertial 
reference frame as a function of time. The GPS satellites 
provide an orbital reference frame that is not truly inertial 
but is slowly varying with time. Uncertainties in the GPS 
orbits can be reduced through estimation of parameters 
(1), (2), and (7) described above. The distances between 
satellites and receivers can be determined from the data 
after removing the effects of parameters (3), (4), (5), and 
(9). Determining the orientation of the satellite constel- 
lation in inertial space may be achieved by fixing the lo- 
cations of a few ground receivers. These sites are known 
as fiducial sites and are tied by local ground surveys to 
nearby, colocated VLBI antennas whose relative positions 
in inertial space are known precisely [9]. The origin of this 
VLBI frame may not be coincident with the Earth’s center 
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of mass as determined by the satellites; this geocenter off- 
set (parameter 8) can also be estimated. Thus the satellite 
framework can be constrained in inertial space, and the 
movements of the solid Earth within that framework, such 
as Earth orientation, can be observed. 

Appropriate a priori uncertainties for all estimated pa- 
rameters are needed to strengthen the solution. Deciding 
which parameters to estimate or consider, what a priori 
values to use, and which data to include depends on the 
physical problem of interest. 

The following questions were addressed in this study: 

(a) How many GPS measurements are necessary to 
generate Earth-orientation values with a preci- 
sion comparable to present techniques? In other 
words, how long need the observation periods be 
in order to produce useful Earth-orientation 
data? 

(b) Are both pseudorange and carrier phase data 
types needed, and what maximum data noise is 
permissible for each type to allow adequate reso- 
lution of Earth-orientation parameters? 

(c) flow important are the effects of solar-radiation 
pressure, station location errors, tropospheric un- 
certainties, and geocenter errors on Earth-orien- 
tation estimation? Do these parameters need to 
be estimated along with satellite states and 
Earth-orientation parameters, or can they be con- 
sidered? 

The parameter-estimation strategy is listed in Table 1. 
Earth-orientation parameters and their rates of change are 
all estimated with a priori uncertainties at least as large as 
the uncertainties expected in VLBI-provided UTPM prior 
to a GPS measurement, and at least a factor of 10 larger 
than the final, desired uncertainties. Satellite states and 
solar-radiation parameters are also estimated, with a pri- 
ori sigmas comparable to the known uncertainties of the 
broadcast satellite ephemerides and of solar-radiation ef- 
fects. Both DSN and non-DSN station locations are esti- 
mated, with the three DSN sites comprising the fiducial 
network constrained more tightly than the non-DSN sites. 
A priori sigmas for the station locations correspond to 
the present-day uncertainties of the VLBI baselines. The 
wet-zenith troposphere delay is estimated with an a priori 
sigma that is appropriate for a dry climate if surface mete- 
orology data are available, and is more than adequate for 
wetter regions if water vapor radiometers are used. 

The geocenter, carrier phase biases, and clock errors 
were all estimated with large, effectively unconstrained 


a priori values. One station clock was fixed as a reference 
clock. All parameters were estimated as constants over the 
observing period, except for the clock errors, which were 
modeled as white noise [10]. 

The assumed data noise levels are appropriate for a 
DSN GPS receiver operating either in a poor, noisy propa- 
gation environment (20-cm pseudorange, 1-cm carrier 
phase) or in a reasonably good environment (5 cm, 0.5 cm, 
respectively). Note that these data noise levels assume in- 
dividual measurements averaged over a 5-minute interval, 
or “batch ” 

Details of the software capabilities and modeling 
strategies for most parameters are described in [10]. Six 
matrix rotations 5 are applied to station-location vectors to 
convert them from an Earth-fixed reference frame (1903.0 
CIO frame) to a geocentric inertial system (J2000). Three 
of these rotations correspond to UT1 and the two compo- 
nents of polar motion. For this study, Earth-orientation 
rates are also needed. The UTPM components are thus 
modeled as 0(£) = a 4- b(t — to), where 9{t) is a UTPM pa- 
rameter residual, a is the estimated value of the constant 
component at some epoch t o, and b is the estimate of the 
rate component. 

III. Results and Discussion 

Presented below are the results of covariance analy- 
ses for three distinct models summarized in Table 2. In 
model A, only pseudorange is employed, and these data 
have a high noise level. This model thus represents a 
“worst case” scenario for determining Earth orientation. 
Model B is identical to A, except that higher quality pseu- 
dorange data are assumed. Model C represents a near- 
optimal situation with regard to data quality: Both high- 
quality pseudorange (5 cm) and carrier phase (0.5 cm) are 
employed jointly. Note that this “best-case” model repre- 
sents the expected quality of the data. In all of these mod- 
els, station locations, wet-zenith tropospheric delays, solar 
pressure, clock offsets, geocenter offset, and carrier phase 
biases are assumed to be estimated along with Earth-orien- 
tation parameters and satellite epoch states. 

The Earth-rotation parameter, UT1-UTC, is not ex- 
pected to be directly measurable by GPS. An error in the 
satellite-node longitudes cannot unambiguously be sepa- 


5 W. I. Bertiger, “Non-force models module,” OASIS Math - 
ematical Description, V. 1.0 , JPL D-3139 (internal docu- 
ment), Jet Propulsion Laboratory, Pasadena, California, 
April 1986. 


4 



rated from uncertainty in UTL This is a problem com- 
mon to all satellite geodesy, including SLR. If, however, 
GPS can determine the change over time of UT1-UTC, 
then this change can be combined with an initial value 
from VLBI measurements to yield the full UT1-UTC as a 
function of time. 

Figure 2 illustrates the ability of GPS to monitor the 
rate of change of UT1-UTC, also known as length-of-day 
(LOD). With eight hours of observation, none of the mod- 
els has achieved the measurement precision of present-day 
techniques. This present-day capability, available from 
YLBI or SLR after a processing delay of a week or longer, is 
indicated on Figs. 2, 4, and 5 by an arrow. With 16 hours of 
observation, however, model C, with its high-quality pseu- 
dorange and carrier phase data, can resolve LOD to better 
than 5 cm. By 24 hours, both models B and C show error 
levels comparable to or better than current uncertainties. 
The best scenario, model C, predicts subcentimeter LOD 
accuracy with 24 hours of GPS tracking. 

This powerful ability to measure the rate of change of 
UT1-UTC enables the estimation of UT1-UTC with the 
help of VLBI. Because GPS and VLBI receivers are colo- 
cated at DSN sites, the VLBI and GPS reference frames 
should be precisely defined with respect to each other, 
yielding a high-quality GPS tie to inertial space. By inte- 
grating LOD over time, the total change in UTl-XJTC can 
be estimated and added to an initial value determined from 
VLBI. If UT1-UTC can be measured accurately to 2 cm 
at an initial epoch, for example, daily estimates of LOD 
with GPS will add less than one centimeter per day to this 
uncertainty. To remain within 30 centimeters of the true 
UT1-UTC, VLBI measurements of UT1-UTC may only 
be needed monthly. This is illustrated in Fig. 3. Combin- 
ing and smoothing all the data from various sources will 
yield better UT1 estimates, but these will only be avail- 
able much later, after additional data are obtained. Thus 
GPS and VLBI are complementary techniques that can be 
combined synergistically to yield improved UT1 estimates. 

Figure 4 illustrates the improvement with observing 
time of the Y component of polar motion (PMY). The 
X component (PMX) behaves in a similar manner and is 
not illustrated here. By 16 hours, the “best case” sce- 
nario of model C shows measurement accuracy comparable 
to present techniques, while by 24 hours model B (high- 
quality pseudorange) also yields an acceptable predicted 
error. The error estimates for all three models seem to 
be converging at the few-centimeter level. This is a lim- 
itation controlled mainly by the a priori station location 
uncertainties (constrained at 3-5 centimeters for each of 
three components). 


Figure 5 shows the estimate of the error in the rate of 
change of PMY as a function of observing time; the rate 
of change of PMX error exhibits similar temporal varia- 
tions. The behavior illustrated here is similar to that of 
the LOD component (Fig. 2). All three models show rapid 
improvement with time; by 24 hours, models B and C 
again achieve high-quality rate measurements. In the best 
case (model C), the error is down at the few- mm/d ay level. 

It appears that GPS determination of Earth orienta- 
tion is feasible for LOD, polar motion, and polar-motion 
rates. With less than 24 hours of tracking, these compo- 
nents will be as well-determined as measurements by cur- 
rent techniques. Recall, however, that current techniques 
only provide these values many days after the measure- 
ments are taken, whereas GPS is expected to deliver the 
results to the DSN within 12 hours. The best estimates 
come, as expected, with the smallest data noise. Com- 
bining high-quality pseudorange and carrier phase data 
reduces the uncertainties in Earth-orientation parameters 
most rapidly, achieving present-day measurement precision 
within approximately 16 hours. High-quality pseudorange 
data alone (model B) is the next-best option, measuring 
UTPM to an acceptable level within 24 hours. 

Do Earth-orientation data continue to improve if ob- 
servations are extended beyond 24 hours? And can UTPM 
be estimated every day with consistently high accuracy? 
To answer these questions, parameters that are not ex- 
pected to change with time, such as station locations and 
GPS orbit epoch states, should be modeled as constant 
over the entire observing period. Earth-orientation pa- 
rameters, which do change with time and for which one 
hopes to observe the most rapid variations possible, need 
to be periodically “reset,” i.e., their uncertainties inflated 
at regular intervals during the observing session while sub- 
sequent data attempt to constrain their then-current val- 
ues. 

Figure 6 illustrates this situation. Data are taken for 
48 hours, with all parameters except Earth orientation 
and clock errors modeled as constant over this time pe- 
riod, Just after 24 hours, the uncertainties of the Earth- 
orientation parameters are reset at their a priori values 
(while clock errors continue to be modeled as white noise). 
Since additional data need only constrain Earth orienta- 
tion, these parameters improve much more rapidly than in 
the first 24 hours. For comparison, Earth-orientation pa- 
rameters estimated as constant over the entire 48 hours are 
also shown. This figure is produced with model B, using 
high-quality pseudorange only; performance is significantly 
enhanced if carrier phase data are included. 
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At 24 hours, PMY error is estimated to be 3 cm. 
Twelve hours after resetting (i.e., at 36 hours) the error is 
already down to 6 cm (versus > 10 cm in the first 12 hours); 
by 48 hours, the error is again at 3 cm. This compares with 
an error of 2 cm if PMY were estimated as constant over 
the full 48 hours. Although the 48-hour estimate after re- 
setting is no better than the 24-hour estimate, the error 
drops much more rapidly in the second 24 hours than dur- 
ing the first. Thus polar-motion measurements every 12 
to 16 hours may be possible with this technique, i.e., after 
other parameters have been suitably constrained during 
the first 24 hours. 

UT1-UTC and PMY rates show even more pro- 
nounced improvement. LOD at 24 hours shows an error 
of about 2 cm; continuing to 48 hours brings this down 
to < 0.5 cm, while resetting at 24 hours yields 3 cm at 
36 hours and 1 cm at 48 hours. The polar- motion rate im- 
proves from 5 cm/day at 24 hours to < 0.5 cm/day after 
48 hours, and to < 1.5 cm/day if reset after 24 hours. 

All these values are comparable to or better than 
present-day measurement capabilities whose time resolu- 
tions are 24 hours or more. Thus long arcs of GPS data 
show promise for frequent high-quality Earth-orientation 
measurements. Future studies will model Earth orien- 
tation stochastically, treating each UTPM parameter as 
a random walk whose standard deviation is allowed to 
grow in a manner consistent with the empirical behavior 
of UTPM. 

To see whether station locations, solar-radiation pres- 
sure, geocenter location, and wet-zenith troposphere delay 
are all truly necessary to be adjusted along with satellite 
orbits and Earth orientation, a number of covariance runs 
were performed in which these parameters were consid- 
ered. One of these is shown in Fig. 7. In this model, 20-cm 
pseudorange and 1-cm carrier phase were assumed. Solar 
pressure and geocenter position were adjusted along with 
satellite epoch states and Earth orientation, while station 
locations and wet-zenith troposphere delays were consid- 
ered. All the Earth-orientation parameters and their rates 
show a high sensitivity to the considered parameters, as the 
formal error (labelled “Data”) is dwarfed by the uncertain- 
ties due to considered effects. The station location errors, 
considered at 5 cm for each component, dominate the to- 
tal error (“Total RSS sigma”), while the wet troposphere 
delay, considered at 1 cm, has a smaller but significant 
effect. 


These results suggest that, with 24 hours of data, sta- 
tion locations and wet-zenith troposphere delays do need to 
be adjusted in the estimation in order to generate reliable 
UTPM estimates (as has, in fact, been done). Additional 
consider runs (not shown) demonstrate that solar pressure 
is a significant error source even at 12 hours and needs 
to be adjusted, whereas the geocenter location does not 
seem to have a significant effect on Earth orientation at 
an uncertainty level of 10 cm. Since consider errors scale 
linearly with the uncertainty in the considered parameter, 
if station location s are known to better than 1 cm in each 
component, their effects will not be significant at 24 hours. 
Similarly, it is not necessary to adjust the wet-zenith tro- 
posphere delay if it is known to better than 1 cm. 

IV. Conclusions 

This covariance analysis demonstrates that 

(1) High-quality length-of-day and polar-motion data 
can be obtained with GPS in under 24 hours, with 
a precision comparable to or greater than present- 
day techniques. 

(2) UT1-UTC may be reliably determined if periodic 
reference frame ties are performed, and initial 
values of UT1-UTC measured, with VLBI using 
colocated receivers. 

(3) The best combination of data types is high- 
quality pseudorange plus carrier phase, although 
high-quality pseudorange (data noise < 5 cm) 
alone performs well. 

(4) It is necessary at present to adjust solar-radiation 
pressure and Y-bias, station locations, and wet- 
zenith troposphere delay in addition to satellite 
states and Earth-orientation parameters. How- 
ever, if the a priori uncertainties in the station po- 
sitions or troposphere can be reduced, they need 
not be included in the estimation process. 

In conclusion, it appears that GPS will be a useful ad- 
dition to the collection of techniques currently employed to 
measure Earth orientation, and it may provide a reliable, 
economical method of monitoring in near-real-time high- 
frequency variations in UTPM. For the DSN, GPS meth- 
ods may considerably reduce the demand for antenna time 
needed to measure Earth-orientation parameters while si- 
multaneously enhancing the parameters’ accuracy. 
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Table 1. Parameter-estimation strategy (a priori sigmas) 


Earth-orientation parameters 


UTl-UTC rate 
PMX, PMY 
PMX, PMY rates 


10 -7 (~ 10 msec/day) 

80 nrad (~ 16 marcsec) 

10 -11 rad/sec (~ 200 marcsec/day) 


Satellite parameters (18-satellite constellation) 


X, Y, Z positions 10 m (each component) 

X, Y, Z velocities 1 mm/sec (each component) 

Solar-radiation pressure (X, Z)* 50 percent 

Y-Bias* 10~ 12 km/sec 2 (100 percent) 


Station parameters (6 stations with global distribution) 


DSN station locations* 
Non-DSN station locations* 

Wet-zenith troposphere delay* 


3 cm (each component) 

5 cm (each component) 
(both 5 cm, if considered) 
10 cm 

(1 cm, if considered) 


Other parameters 


Geocenter* 

Carrier phase biases 
Satellite and station clocks 
(modeled as white noise) 


100 m (each component) 
(10 cm, if considered) 

10 km 

1 km (except one station) 


Data noise 


Pseudorange 20 cm, 5 cm (5-min batches) 

Carrier phase 1 cm, 0,5 cm (5-min batches) 


* These parameters were considered in the covariance analyses dis- 
cussed in the text, but are estimated in the models shown in Table 2. 


Table 2. Models 


Model 


Data 


A (“worst case model”) 
B 

C (“best case model”) 


20- cm pseudorange only 
5- cm pseudorange only 

5-cm pseudorange 
0.5- cm carrier phase 



CRUST-FIXED AXIS 




Fig. 1. Schematic illustration of the components of Earth 
orientation: polar motion (PMX, PMY) and UT1-UTC. 


Fig. 3. Illustration of how precise periodic VLBI measurements of 
UT1-UTC can be combined with dally GPS LOD measurements to 
constrain the uncertainty in UT1-UTC. 




16 24 

LENGTH OF OBSERVATION, hr 


Fig. 2. Predicted uncertainty In the rate of change of UT1-UTC 
as a function of observation time. The arrow at right Indicates 
present-day measurement capability in this and subsequent fig- 
ures. Length-of-day uncertainty can also be found from this figure, 
as LOD = (UT1-UTC rate) x 1 day. The units of LOD are cm. 
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Fig. 4. Predicted uncertainty in Y polar motion (PMY) as a func- 
tion of observing time. X polar motion (PMX) behaves In a similar 
manner. 
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LENGTH OF OBSERVATION, hr 

Fig. 5. Predicted uncertainty in the rate of change of Y polar motion 
as a function of observing time. The PMX rate behaves similarly. 




Fig. 6. Predicted improvement in estimates of UTPM after two days. Two cases are shown In which Earth-orientation parameters (o) 
are modeled as constant over 48 hours and in which these parameters (A) are reset to large values after 24 hours. Model B is used, 
employing only high-quality pseudorange data. 
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X POLAR MOTION Y POLAR MOTION UT1-UTC RATE X POLAR RATE Y POLAR RATE 


Fig. 7. Consider errors at 24 hours. Station locations and wet-zenith troposphere delay were considered with uncertainties of 5 cm per 
component and 1 cm, respectively. In this covariance analysis, satellite epoch states, Earth orientation, solar pressure, clocks, carrier 
phase biases, and the geocenter were ail adjusted. The assumed data noise was 20 cm pseudorange and 1 cm carrier phase. The “Total 
RSS sigma” denotes the root sum square of the adjusted error estimate (“Data”) and the two considered errors. 
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A simple analytical model has been developed for the electric and ponderomo- 
tive (trapping) potentials in linear ion traps. This model was used to calculate the 
required voltage drive to a mercury trap, and the result compares well with exper- 
iments. The model gives a detailed picture of the geometric si ape of the trapping 
potential and allows an accurate calculation of the well depth . The simplicity of 
the model allowed an investigation of related, more exotic trap designs which may 
have advantages in light-collection efficiency. 


I. Introduction 

Radio frequency (RF) quadrupole ion traps have great 
importance to the development of new atomic frequency 
standards and high-precision measurements. The three- 
dimensional quadrupole trap has mainly been used. The 
ideal geometry for the electrodes of this trap is hyper- 
boloids of revolution, which produce a pure quadrupole 
field. Practical traps are built with a different, more open 
geometry because of the need to collect light efficiently 
from the ions. Ion fluorescence is used to determine its 
quantum state, and the collection efficiency is an important 
factor in the signal-to-noise ratio. Deviations in the elec- 
tric field and trapping potential from a pure quadrupole 
are usually not considered in detail due to the difficulty in 
calculating them. 

In order to increase the number of trapped ions with- 
out degrading the frequency stability, the Time and Fre- 
quency Systems Research Group recently introduced into 
frequency-standard research a linear trap based on the 
quadrupole mass spectrometer [1]. A side benefit of this 


trap geometry is the ability to calculate the electric and 
trapping potentials to good accuracy with a simple an- 
alytic model. In this article, the model is developed and 
used to predict an ion resonance frequency, which has been 
experimentally measured. The article then shows how the 
model can also be used to investigate similar but more 
exotic geometries which may be advantageous in some ap- 
plications. 

The linear trap consists of four parallel cylindrical 
rods arranged with their centers on the corners of a square. 
An RF voltage is applied to the rods so that nearest neigh- 
bors have opposite polarity. This creates an alternating 
two-dimensional quadrupole electric field between the rods. 
The field confines ions along the center axis of the trap 
by the ponderomotive force, just as in a quadrupole mass 
spectrometer. Two endcap electrodes with a DC bias volt- 
age applied confine the ions axially. This article considers 
only the center axial region, far enough from the endcaps 
so that the field is essentially two-dimensional. The DC 
fields from the endcaps decay exponentially along the axis, 
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so that this restriction applies to the vast majority of the 
trapping volume. 

The electric fields due to four cylindrical rods have 
been studied for years by designers of mass spectrometers 
and particle accelerators. In order to achieve the closest 
approximation to a true quadrupole field, it has been found 
that the ratio of the rod radius to the distance between the 
rod center and the trap axis reaches its optimum value at 
0.5342. This value was first determined by measurements 
on a quadrupole accelerator magnet [2] and later repro- 
duced with numerical calculations [3]. 

A radius-spacing ratio of 0.5342 produces a geometry 
that is too closed to allow efficient light collection from 
ions. A pure quadrupole field is not really necessary if ion 
confinement is the only goal; hence, in this case the ra- 
tio can be reduced. The trap discussed here has a ratio 
of 0.25, and even smaller ratios might be desirable. The 
smallness of the ratio leads directly to a simple approxima- 
tion, namely, the fields produced by infinitely thin rods. A 
conducting rod with an applied voltage has some induced 
charge, and as the rod diameter is reduced, all the charge 
coalesces into a line. This idea can be used to transform a 
two-dimensional boundary- value problem into a much sim- 
pler calculation of the potential produced by fixed sources. 
All the calculations that follow are based on fields gener- 
ated by an array of uniform parallel line charges. 


II. Model for a Four-Rod Trap 


Changing to polar coordinates, it is found that 


|F| 2 = 


16r 2 

r 8 — 2 r 4 cos 40 4 


— = TG(r, 0) 


( 1 ) 


where T = 16. One can expand G about the origin to 
obtain 

G(r, 0) « r 2 4 2 r 4 cos 40 4 r 8 ( 1 4 2 cos 80) H J 


where the leading term r 2 is an isotropic harmonic po- 
tential, and the higher-order terms have at least fourfold 
symmetry. It turns out that any two-dimensional field con- 
figuration that vanishes at a point produces an isotropic 
harmonic trapping potential in lowest order about that 
point. In the trap configurations analyzed later, the func- 
tion G is always defined to have a leading term r 2 , and T 
is used for the numerical factor. 

The trapping potential model function G(x,y) is 
shown in a three-dimensional plot in Fig. 1, and shows a 
center well rising up to singularities at the rods. Halfway 
between the rods are saddle points beyond which the po- 
tential drops again. The height of the saddle points sets the 
maximum energy an ion can have and still stay trapped, 
i.e., the well depth. The well depth can be calculated by 
setting 0 = 7r/4 in Eq. (1) and finding the maximum value 
of G(r, 7r/4). The maximum occurs at r s — 3" 1 / 4 = 0.760, 
and has a value of G s = 9/16\/3 = 0.3248, where the 
subscript s is the saddle point. 


Since the model field is two-dimensional, the method 
of complex variables can be used. This is not necessary, 
but makes the calculation of the trapping potentials a little 
easier. Adopting the notation of Landau and Lifshitz [4], a 
positive line charge at z = zq produces a complex potential 
w = — lo g(z — zq ) 1 whose real part is the ordinary electric 
potential. Dimensionless quantities are now used to cal- 
culate the geometric form of the potentials. Scale factors 
are introduced later to calculate real trap parameters. The 
model for the quadrupole trap consists of two negative line 
charges at z — 4 i and two positive line charges at z = ±1. 
This produces the complex potential 



Now that the electric and trapping potentials pro- 
duced by four line charges have been calculated, a real trap 
may be modeled. The electric equipotentials of the model 
are plotted as contours in Fig. 2. Close to the charges, 
they have a nearly circular shape. The equipotentials of 
a real trap are exactly circular at the electrode surfaces. 
The deviation of the modeFs equipotential from circularity 
is calculated by finding its horizontal and vertical “diam- 
eters” and taking the difference. The electric potential V 
is the real part of w, and is given by 

2 v _ x 4 + y 4 + 2(x 2 - y 2 4- x 2 y 2 ) + 1 _ „ 2 f2) 

C x 4 + y 4 + 2(y 2 — x 2 -f x 2 y 2 ) + 1 


The ponderomotive trapping potential is proportional to 
the square of the electric field F [5]. To calculate this, note 
that —F x 4 iF y = dw/dz , so that 


The equipotential surrounding the charge at z = 1 is to be 
examined here, so first the intersection of the equipotential 
with the x-axis is found by using 


\F\ 2 = 


dw 2 16|z| 2 

U ~ |z 4 -l| 2 


|1 + * 2 1 

|1 - x 2 | 


= D 


( 3 ) 
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For a fixed value of V, Eq. (3) has two positive and two 
negative solutions. If the smallest positive solution is de- 
noted zo, and the other positive solution is denoted x i, 
then Eq. (3) produces the following relationships; 


1 = 

x 0 V B - 1 


(4a) 


d = X\ — xq 



2 

VB S - 1 


(4b) 


XQ 4- Xi _ 1 -f xl __ B 
2 - x 0 ~ y/B 2 - 1 


(4c) 


which are valid for B > 1. Notice that there is a symme- 
try between B and 1/B corresponding to the symmetry 
between V and —V. The case of B < 1 applies to the 
region near the charges at z — ±i. 

Point 2{> corresponds to the inner surface of a rod, X\ 
to the outer surface of a rod, d to the horizontal diameter of 
the rod, and c to the center of the rod. The “rod” defined 
here is a hypothetical electrode shaped so that four of them 
will produce a field distribution equivalent to the four line 
charges. To calculate the vertical diameter, one can use 
the intercepts ±y 0 on the line x = c. Using Eq. (2) results 
in 


will have their boundaries at greater distances than 1.65. 
Therefore, boundaries will be ignored in this treatment. 

In order to use the model to calculate the parameters 
of a trap, the “squashed” rods of the model are associated 
with the round electrodes of the trap, and the linear di- 
mensions are scaled appropriately. Dimensions of the real 
trap are given in capital letters; model dimensions are in 
lower case. The distance of the line charges from the trap 
axis is A, and they will not be exactly at the center of the 
electrodes. The inner and outer x-axis intercepts of the 
trap rods are called Xq and X\ , respectively. Therefore, 
the dimensionless coordinates are given by 


X 0 

X0= - 

(5a) 

X! 

Xl = ~ 

(5b) 

A 2 = X 0 X x 

(5c) 


The mercury ion linear trap has dimensions Xq — 7.62 mm 
and = 12.7 mm, implying A — 9.84 mm (compared to 
10.2 mm for the true center), xo = 0.774, x\ — 1.29, and 
B = 3.99. The horizontal and vertical diameters of the 
equivalent model trap rod differ by 3 percent, providing a 
good approximation to a circle. 


2 B (v^B 2 + 2) - B) - 1 ! 


The approximation is very good for B > 2, and improves as 
B increases. The fractional difference of the rod diameters, 
(d — 2yo)/d, scales as l/(27? 2 ), and is consistent with the 
idea that the equipotentials become more circular as the 
line charge is approached. A curious feature of the model is 
evident in Eq. (4c). The center of the rod c depends on B 
and does not coincide with the line charge at x = 1. Seen 
another way, the line charge sits at the geometric mean of 
the inner and outer surfaces of the rod (xq and xi) and 
only approaches the arithmetic mean as B becomes large. 

So far, it has been assumed that the boundary is in- 
finitely far away. This is allowed since the potential V in 
Eq. (2) tends toward zero for large r. The fact that the to- 
tal charge is zero assures this. Denison [3] has numerically 
calculated the effect of a circular grounded boundary at 
r = 1.65 for an optimized quadrupoie with a radius-spacing 
ratio of 0.5342. He found that the boundary changed the 
optimum ratio by about 1 percent. Although a more open 
trap may be more sensitive to a boundary, practical traps 


In order to test the predictive power of the model, the 
applied voltage needed for the mercury ion trap is calcul- 
cated to produce a natural ion resonance frequency u) of 
2 tt x 48.5 kHz. For small displacements, the leading r 2 
term in the trapping potential leads to harmonic motion. 
The ions are detected by amplifying the current they in- 
duce on the trap rods at the natural frequency, and the 
amplifiers are tuned to 48.5 kHz. The driving voltage is 
applied at a frequency Q of 27T x 500 kHz in a balanced 
mode, so that the peak voltage on two opposing rods is Vq 
and the voltage on the other two rods is — V 0 with respect 
to the vacuum system. The balanced drive keeps the trap 
axis at zero potential, which allows the application of only 
DC bias to the endcaps. The amplitude of the applied RF 
voltage is ramped until a resonance signal appears, indi- 
cating that the natural frequency of the trap matches the 
frequency to which the amplifiers are tuned. 

Calculating the required applied voltage uses the fact 
that on the inner surface of the rod xo, the electric poten- 
tial V equals the applied voltage Vq , so that 

V _ In |(1 + x 2 )/( 1 - x 2 )[ _ In [(1 -h x 2 )/(l - x 2 )| 

Vo - ln|(l + ^)/(l-^)l ~ ln5 
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and the electric field is 


and have the value G 8 — 1/4. The electric potential can 
be written as 


\E \ 2 = 


r YlGf^y) 

(Ain BY 


e 2V = x 4 + y 4 + 2(x 2 — y 2 + x 2 y 2 ) + 1 = 5 2 (9) 


The ponderomotive trapping potential ^ is given by 


e 2 E 2 Te 2 V 2 G(x,y) 
4mfi 2 4mf2 2 (Alni?) 2 


One can calculate the intercepts of an equipotential on the 
y-axis around the charge at z — i and obtain relations 
analogous to those of the four-rod trap: 


where m is the mass of the ion [5]. Using the harmonic 
approximation for G leads to an expression for the natural 
resonance frequency 


y/2 mQ 2 A 2 In B 


( 7 ) 


For mercury isotope 199 and the trap parameters given 
above, Eq. (7) predicts a peak drive voltage of 93.7 V. The 
experimental value is 100 ± 5 V, giving agreement almost 
within experimental error. The well depth can also be 
calculated using Eqs. (6) and (7) as 

». = 2 ^^ ( 8 ) 


yielding a value of 3.00 eV for the parameters here. 


III. Model for a Two-Rod Trap 

The simplicity of this model makes it useful for ana- 
lyzing other two-dimensional trap geometries. Any system 
that produces a point with a vanishing electric field can 
potentially trap ions. The simplest system consists of two 
rods driven by the same voltage with respect to a dis- 
tant boundary. This configuration has a zero field point 
midway between the two rods. A trap of this type was 
demonstrated with oil droplets by Straubel [6] very early 
on, and may be of use in frequency standards due to its 
wider viewing angle. The model for this trap is two equal 
negative line charges at z = ±i. The complex potential is 

w = ln(z 2 + 1) 

and the geometric form of the ponderomotive potential is 

r 2 

G ( r ’ ^ ~ P + 2r 2 cos 2 4 > + 1 

with T = 4. Contours of the electric equipotentials and a 
plot of the ponderomotive potential are shown in Figs. 3 
and 4. The two saddle points are on the z-axis at r s — 1, 


I/O = V 1 — B 

(10a) 

yi = Vl + B 

(10b) 


for B < 1. Combining Eqs. (10a) and (10b) to calculate 
the scaling law for a real trap results in 2 A 2 = Y 2 + Y 2 , 
where Yq and Y\ are the inner and outer surfaces of the 
rod. For a trap with rods the same size and spacing as 
two opposing rods of the mercury ion trap, A = 10.5 mm. 
This time the line charge is more distant than the rod 
center (10.2 mm). A calculation of the diameter difference 
gives a deviation from roundness of 3 percent. 

In order to calculate the applied voltage, the surround- 
ing boundary must be taken into account, because the bars 
must be driven with respect to something. According to 
Eq. (9) and Fig. 3, the equipotentials for large r become 
roughly circular with a value of 21nr, and it is assumed 
that the boundary follows one of these contours. The po- 
tential on the trap axis is zero and the potential on the 
rod is Ini?. The applied voltage will be proportional to 
2 In r — In f?, where r = R/A and R is the radius of the 
surrounding boundary. The trapping potential can then 
be written as 


^ 4mQ 2 A 2 (2 In r — In B ) 2 

The dimensions of the vacuum system correspond to a 
value of 2.5 for r. In order to duplicate the natural res- 
onance of 48.5 kHz with the tworod trap, a peak drive 
voltage of 364 V is necessary, an increase by a factor of 3.9 
over the four-rod trap. The well depth is still governed by 
Eq. (8), and depends on the boundary only through the 
voltage necessary to maintain d. The well depth then is 
2.63 eY. The price paid for an increased viewing angle is 
a larger drive voltage and a slightly smaller well depth. 

An added complication with this trap is that the elec- 
tric potential at the trap axis is now oscillating with respect 
to the vacuum system at the drive frequency. This makes 
an added AC bias to the endcaps necessary to keep them 
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at a constant potential above the trap axis. Straubel did and for the outer rods (B 2 < 1) 
not use endcaps, but relied upon fringing fields from the 

ends of his rods for axial confinement. d 2 = B 2 


IV. Models for a Three-Rod Trap 

Although the viewing angle is larger in the two-rod 
trap, the trapping point is still between the two rods. It 
may be desirable for some applications to trap well out- 
side the electrode structure. This becomes possible with a 
three-rod trap. A simple example of a three-rod trap model 
consists of a positive line charge at the origin flanked by 
two equal negative line charges on the real axis at z = ±1. 
The resulting complex and ponderomotive potentials are: 




Once again, the intercepts of the outer rod obey the rela- 
tions of Eq. (5). The boundary is again assumed to con- 
form to a long-range equipotential at dimensionless radius 
r with value lnr. All the potentials can be offset by In r 
to keep the boundary at ground and obtain the following 
form for the applied voltages: 


Vi = HBi/r) 
V 2 In (B 2 /r) 


and 


2 f4 + 2r 2 cos 20 + 1 
r 2 (r 4 _ 2 r 2 cos 20 + 1) 


( 11 ) 


The electric field vanishes at z = ±i, and these are the 
trapping points. The electric and trapping potentials are 
shown in Figs. 5 and 6. 


If Eq. (11) is expanded about the trapping point 2 = 
then T = 4. The expression for G is too complicated to 
be very useful. The lowest saddle point can be found from 
Eq. (11) at r 9 — \flL + \/5 = 2.058, and 0 = n/2 where 
G s = 0.0217. This gives a well depth of 200 meV for the 
parameters used here. The question of the applied voltage 
is slightly more complicated than in the previous two cases. 
Since the total charge is not zero, the boundary must again 
be included in the calculation. There are now three volt- 
ages involved: the central rod Vi, the outer rods V 2} and 
the boundary V3. The ratios between these voltages can 
be calculated from the expression for the potential 


Using a value of 2.5 for r and the same rod diameters and 
spacing as before obtains peak applied voltages of 49.4 V 
and —214 V for V\ and V 2 . A suitably tapped transformer 
could be used to provide these drive voltages. Alterna- 
tively, the ratio of the inner and outer rod diameters could 
be adjusted to force B\B 2 = r 2 so that Vi = — V 2 . Even 
with this simplification, an additional AC voltage is nec- 
essary to bias the endcaps as in the two-rod case. 

The influence of the boundary can be removed by 
choosing the total charge to be zero. This can be ac- 
complished by doubling the center charge. Unfortunately, 
there is no trapping point with this configuration while the 
charges are in a line. Moving the center charge down along 
the imaginary axis to z = — ib creates a trapping point at 
2 = ijb. The case of b = 1 has been analyzed, and the re- 
sults are shown here. The electric and trapping potentials 
are plotted in Figs. 7 and 8. The scaling factor V is 1/4, 
and the saddle point is on the y-axis at y s — 1.839 with 
G s = 0.0728. The electric potential is 


2 v _ x 4 + V 4 + 2 (y 2 - x 2 + x 2 y 2 ) + 1 = 2 

x 2 + y 2 


x 4 + y 4 + 2(y 2 - x 2 + x 2 y 2 ) + 1 
( x 2 + y 2 + 2y + l) 2 


and the various intercepts. For the center rod (B 1 > 1), 
the relation between the B value and the dimensionless 
diameter is 


and the x intercepts of the outer rods still follow the re- 
lationships of Eqs. (4) and (5), with B replaced by \/B\ 
(for B\ < 1). The y intercepts of the low T er rod obey the 
relationship below (for B 2 > 1): 


di ~ B 1 





2 

B 1 


2/0,1 


— B 2 ± \/ < lB 2 ~ 1 

b 2 ~i 
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and Eq. (5). Once again, the voltages applied to the rods 
will not be equal and opposite unless the diameters are 
adjusted so that B x - 1/S 2 * The endcaps need only DC 
bias because the potential is zero at the trapping point. 

The small value of T , which is 1/4, makes the drive 
voltage for the outer rods eight times that of the four-rod 
trap for the same ion resonance frequency. This fact, plus 
the small well depth, may limit the usefulness of this trap. 
In both three-rod traps, however, the low saddle point lies 
only on one side of the trapping region. The potential 
barrier on the opposite side is much higher. These traps 
may be useful for trapping macroscopic particles, where 
gravity plays an important role. The trap rods could be 


oriented horizontally so that the low saddle point lies above 
the trapping region. 

V. Conclusions 

This article has presented a simple model for linear 
ion traps that permits the accurate calculation of trapping 
parameters and gives a detailed picture of the potentials. 
The model was used to analyze some new trap geometries, 
and their advantages and disadvantages were discussed. 
Each trapping geometry is characterized by the two pa- 
rameters T and G s , which determine the dependence of its 
natural resonance frequency and well depth on the applied 
RF voltage and trap dimensions. 
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Fig. 1. The ponderomotlve potential function G for a four-rod 
quadrupole trap Is plotted in three dimensions. The singularities 
are truncated at the value 1.0 for clarity. 



- 3.0 - 2.5 - 2.0 - 1.5 - 1,0 - 0.5 0 0.5 1.0 1.5 2.0 2.5 3.0 


Fig. 2. The electric potential for a four-rod trap is shown in this 
contour plot. The range of potentials shown Is —1 to 1 at intervals 

of 0.25. 


- 2.5 - 2.0 - 1.5 - 1.0 - 0.5 0 0.5 1.0 1.5 2.0 2.5 

Fig. 3. The electric potential for a two-rod trap is shown in this 
contour plot. The range of potentials shown Is —1 to 2 at Intervals 
of 1/3. 



Fig. 4. The ponderomotlve potential function G for a two-rod 
quadrupole trap is plotted In three dimensions. The singularities 
are truncated at the value 1.0 for clarity. 





-3.0 -2.5 -2.0 -1.5 -1.0 -0.5 0 0.5 1.0 1.5 2.0 2.5 3.0 




3.0 -2.5 -2.0 -1.5 -1.0 -0.5 0 0.5 1.0 1.5 2.0 2.5 3.C 


Fig. 5. The electric potential for a three-rod trap is shown In this , , 

contour plot. The range of potentials shown is -1 to 1.5 at inter- "fl- 7 ; The electric potential for a three-rod trap with zero net 

vals of 0 25 line c ^ ar 9 e * s shown * n this contour plot. The range of potentials 


shown is —2 to 4 at intervals of 0.5. 
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Fig. 6. The ponderomotive potential function |F 2 | for a three-rod 
quadrupole trap Is plotted in three dimensions. The singularities 
are truncated at the value 0.5 to emphasize the potential well at 






Fig. 8. The ponderomotive potential function | F 2 /4| for a three- 
rod quadrupole trap with zero net line charge Is plotted In three 
dimensions. The singularities are truncated at the value 0.1 to 
emphasize the potential well at z — 
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This article develops and analyzes oscillator configurations which reduce the 
effect of 1/f noise sources for both direct feedback and stabilized local oscillator 
(STALO) circuits , By appropriate use of carrier suppression, a small signal is gen- 
erated which suffers no loss of loop phase information or signal-to-noise ratio. This 
small signal can be amplified without degradation by multiplicative amplifier noise, 
and can be detected without saturation of the detector. Together with recent ad- 
vances in microwave resonator Qs, these circuit improvements will make possible 
lower phase noise than can be presently achieved without the use of cryogenic de- 
vices. 


I. Introduction 

Phase fluctuations in microwave oscillators show a 
characteristic l// 3 spectral density for frequencies very 
close to the carrier. The spectral density S^(/) can be 
expressed in terms of rad 2 per Hz bandwidth at an offset 
frequency / from the carrier. At larger frequency offsets 
(/ > 10 kHz), fluctuations decrease more slowly, typ- 
ically approaching a more or less constant value for fre- 
quencies of 100 kHz and above, This article is concerned 
with reduction of the fluctuations very close to the carrier 
by the use of circuit techniques not previously applied to 
microwave oscillators. 

In order to achieve a microwave signal with the lowest 
possible noise at all offset frequencies, a variety of tech- 


nologies is presently used [1, 2, 3]. Typically, high stabil- 
ity for small offsets (/ < 100 Hz) is obtained by locking 
the microwave oscillator to a harmonic of a low-frequency 
(5 MHz) bulk acoustic wave (BAW) quartz-crystal oscilla- 
tor. For offset frequencies in the range of 100 Hz < / < 
10 kHz, further stabilization may be provided by a surface 
acoustic wave (SAW) oscillator operating at » 500 MHz. 
The microwave oscillator itself provides the best possible 
stability only for relatively large offset frequencies (/ > 
10 kHz). 

While all the sources listed above show l// 3 type 
phase fluctuations for small values of offset frequency /, 
the microwave oscillator itself shows by far the highest 
noise. This is due to two contributing factors: the low 
Q available for microwave resonators and the large 1 // 
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phase noise in available microwave devices. (The action 
of the oscillation loop converts this l/f phase noise into 
1 // frequency noise, which is mathematically equivalent 
to l/f 3 phase noise [1, 16].) Thus, while BAW and SAW 
quartz crystals show Qs of 10 6 and 10 5 respectively, mi- 
crowave cavities and dielectric resonators are limited to 
Qs in the range 1-3 xlO 4 . In like manner, active devices 
which are available at the lower operating frequencies of 
the quartz devices show l/f phase noise of —140 dB per 
IIz or lower at an offset frequency of / = 1 IIz, while the 
best 8-12 GHz (X-band) amplifiers have noise of -120 dB 
per Hz [13, 14]. 

Recently, a new type of microwave resonator has been 
demonstrated with Qs 10 to 1000 times larger than pre- 
viously available [4-11]. This sapphire whispering-gallery- 
mode resonator allows the intrinsic Q of the sapphire itself 
to be utilized by isolating the fields to the sapphire element 
and away from lossy metallic container walls. These res- 
onators have shown Qs of 2 x 10 5 at room temperature and 
3 x 10 7 at 77 K. Tests of an 8-GHz oscillator stabilized by 
such a sapphire resonator (a stabilized local oscillator, or 
STALO) show l/f 3 noise 10 dB lower than previously re- 
ported for any 8-12 GHz (X-band) source and only 22 dB 
higher than the best quartz-crystal stabilized oscillator [9, 
12]. If the device noise in this oscillator could be reduced 
by 20 dB or more, the need for quartz stabilization would 
be eliminated, and a new oscillator capability would be- 
come available. 

The following sections describe two substantially dif- 
ferent implementations of an idea in which negative feed- 
back in the oscillator is generated by means of a sup- 
pressed-carrier microwave signal. In one implementation, 
the signal is fed to a semiconducting phase detector to 
enhance its sensitivity while avoiding saturation. In this 
STALO configuration, phase-detector noise is effectively 
reduced by the enhanced sensitivity. In the second im- 
plementation, direct RF feedback of a signal with sup- 
pressed carrier induces both oscillation and negative phase 
feedback. In this case, the degree of improvement in l/f 
phase noise over that which characterizes the amplifier it- 
self is equal to the degree of negative feedback which can 
be achieved without oscillation in unwanted modes. 

A comparison of the two implementations shows the 
second (direct RF feedback) to be somewhat trickier in 
concept (involving both negative-phase feedback and 
positive-amplitude feedback in the same loop), and sim- 
pler in realization. Conditions of stability require the use 
of a filter with a performance that is expected to limit 
the usable loop gain to the 20-30 dB range. The STALO- 
type implementation, while more elaborate, is somewhat 


less tricky, and should allow larger gains to be used. Both 
should show crystal-oscillator-type performance in a room- 
temperature 10-GIIz (X-band) oscillator using a whisper- 
ing-gallery-mode sapphire resonator with an intrinsic Q of 
2 x 10 5 , and allow dramatic improvements in the state of 
the art with cooled resonators and higher Qs. 

II. Background 

Figures 1 and 2 show conventional microwave self- 
excited oscillator and STALO configurations, together 
with an identification 9 f the in-oscillator and oscillator out- 
put noise spectral densities. In the self-excited oscillator 
shown in Fig. 1, the oscillation condition requires that the 
phase shift around the complete feedback loop comprising 
the amplifier, resonator, and interconnections be a multi- 
ple of 2 7 T. With this condition satisfied, any phase fluc- 
tuation in the microwave amplifier must be accompanied 
by an opposite shift of equal magnitude in the resonator. 
For slow phase fluctuations (/ <C *VQ)> characteris- 
tic phase slope of the resonator 6 <j)/8v = 2 Q/v implies 
a corresponding slow fluctuation in the frequency of the 
oscillator. Here / represents the fluctuation frequency, v 
the microwave frequency, Q the quality factor of the res- 
onator, and <j> the phase of the microwave signal. In this 
way, a power spectral density of phase fluctuations for the 
amplifier S$(f) \ a mp results in oscillator output frequency 
noise 

Sy(f) lout — (2Q) 2 S^(/) lamp 
or the mathematically equivalent output phase fluctuations 

S<f>(f) |out = (2Q) 2 S<t>(f) \arnp (1) 

where y = hv/v is the fractional frequency deviation. 

Figure 2 shows the schematic diagram for a STALO 
in which the frequency variations of a noisy microwave 
source are cancelled by a feedback loop that detects the 
consequent phase shifts across a high-Q resonator to gen- 
erate a frequency-correction voltage. The phase from the 
reference loop is adjusted to produce the proper sign of 
the correction voltage and attain maximum sensitivity by 
operating in the high slope region of the mixer output ver- 
sus reference phase relation. In the limit of large loop gain, 
stable equilibrium requires that the phases at the two input 
ports of the mixer be in quadrature (mixer output = zero). 
A significant advantage of the STALO is that the proper- 
ties of the feedback loop are particularly easy to control, 
since the signal is mixed down to baseband (near zero fre- 
quency). This allows the use of active filters with narrow 
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bandwidths and sophisticated response shapes which are 
not possible at microwave frequencies. A second advantage 
is that the 1/f noise for X-band mixers (—135 dB // per Hz 
at 10 GHz) [15] is better than that which is available from 
the best amplifiers (—110 to —120 dB//) [13, 14]. The 
analysis from the self-excited oscillator can be adapted to 
the STALO by noting that the only difference is that the 
phase detection and correction has been moved from the 
resonator in Fig. 1 to the mixer-amplifier-oscillator combi- 
nation in Fig. 2. The phase noise of the RF amplifier of 
Fig. 1 is replaced by that of a mixer. Consequently, the 
output phase fluctuations are described by 

Sdf) U = (2 QY 2 (j) S+{f) u„ 

As a consequence, the performance of a direct-feedback 
oscillator with an amplifier having 1// noise of —120 dB 
per nz at 1 Hz (5^(/) lamp = 10 -12 // rad 2 per Hz) is 

U=io- 12 (2Q)- 2 0!) 

while a STALO using a mixer with 5^(/) | mtx = 10“ 13 5 // 
rad 2 per Hz will be 15 dB quieter. These device noise lev- 
els represent the quietest components presently available, 
giving a clear advantage to the STALO configuration. 

III. Oscillator Configurations for 
Reduced Noise 

Three new oscillator configurations for low phase noise 
are detailed in this section. The first two are STALO con- 
figurations that reduce the effect of mixer noise by increas- 
ing its sensitivity by the use of a suppressed-carrier signal. 
Of these two, the first achieves increased sensitivity by op- 
erating the high-Q resonator at higher power than would 
otherwise be possible. In the second, a similar effect is 
achieved by use of a low-level RF amplifier. The third 
oscillator configuration uses direct RF feedback of a sort 
that simultaneously achieves positive-amplitude feedback 
and negative-phase feedback. In this way excess gain of 
the amplifier is used to reduce its effective phase noise. 

The STALO shown in Fig. 3 forms the basis for the 
new designs. It differs in implementation from Fig. 2 in 
that the signal from the cavity to the mixer is not taken 
from a second coupling port but is instead taken from the 
signal reflected from the input port. A circulator separates 
this signal from the forward-driving signal. At critical cou- 
pling and on resonance, the returned signal is identically 


zero. However, it is the superposition of two equal signals, 
one of which emanates from the cavity, and a second, re- 
flected signal which is derived from the driving signal with 
a constant phase shift. This reflected signal does not sig- 
nificantly affect the operation of the mixer at resonance 
since it is in quadrature with the signal at the other mixer 
port. Thus the two STALOs will have approximately iden- 
tical performance. Figure 4 shows the returned signal for 
small errors in local oscillator frequency. While the ampli- 
tude goes through zero on resonance, a phase reversal takes 
place in which the in-phase signal on one side becomes 
out of phase on the other, allowing a linear dependence of 
mixer output voltage on the frequency error, as required 
for effective feedback. Instead of viewing the mixer as a 
phase detector, it is seen as projecting the component of 
the signal at the r input onto the phase of that at the / 
input. 

A. STALO Design for Enhanced Phase 
Detector Sensitivity 

In this configuration, enhanced sensitivity in the phase 
detector is achieved by means of relatively high power in 
the high-Q resonator. It has the advantages of simplic- 
ity and absence of any amplifier to introduce added phase 
noise if carrier suppression is incomplete. A disadvantage 
is that power limitations in the microwave source or high-Q 
resonator restrict the available improvement factor. 

As previously discussed and as shown in Fig. 5(a), sup- 
pression of the carrier at the r port of the mixer in Fig. 3 
has only incidental consequence regarding mixer sensitiv- 
ity, since the suppressed part of the signal is in quadrature 
with the reference signal at the / port. (The part of the 
signal due to frequency variations, is in phase with 

the reference and so is detected in any case.) However, 
things are not quite identical to the conventional STALO 
shown in Fig. 2. Suppression of the carrier at the r port 
allows the power to the high-Q resonator to be increased 
without saturating the mixer. This increased power results 
in an enhanced sensitivity of the mixer output voltage to 
frequency variations 

Figure 6 describes such a circumstance. Besides the 
increased power levels in oscillator and resonator, the only 
difference from Fig. 3 is an appropriately weaker coupling 
to the mixer’s l port. Mixers typically saturate at signal 
levels on the order of 20 milliwatts, while frequency sources 
and resonators can operate at power levels up to 1 watt or 
even higher. The resultant increase in sensitivity of up to 
17 dB reduces the consequence of mixer noise by the same 
factor. For a mixer with flicker noise of — 135 dB per Hz at 
1 Hz offset, the effective noise could be reduced to a value 
of —152 dB per Hz. 
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B. STALO Design Using RF Amplification 

Figure 7 shows a further modification of the STALO 
which can result in improved performance. Here the small, 
nominally zero signal returned from the resonator is ampli- 
fied before it enters the mixer. Two effects of this addition 
are easy to understand. The loop gain will be increased by 
the added gain, an effect which may be compensated for in 
the design of the baseband amplifier. Secondly, the gain of 
the amplifier will increase the sensitivity of the mixer out- 
put to phase error in the resonator without significantly 
affecting mixer noise. Thus the effective mixer phase noise 
is reduced by the amount of amplifier gain. This can be a 
very substantial improvement. 

The third effect of this modification is a little more 
complicated. Since amplifiers are somewhat more noisy 
than mixers (—120 dB versus —135 dB at 1 Hz offset as 
previously discussed), a crucial point is the proper analysis 
of the contribution of amplifier noise. The kind of noise 
under discussion is not additive noise, which would be inde- 
pendent of any large signal also present, but instead is mul- 
tiplicative noise, which transforms a large signal by slightly 
modifying its amplitude and phase. (Additive noise in 
good amplifiers is insignificant except at offset frequencies 
/ >^ 10 kHz where the l/f multiplicative noise is rela- 
tively small.) Figure 5(a) shows in phasor form the cavity 
signals with and without carrier suppression corresponding 
to the oscillators shown in Figs. 3 and 2, respectively. The 
added signals due to small frequency variation in the local 
oscillator are also shown. These added signals are detected 
by the mixer in order to allow feedback circuitry to can- 
cel the frequency variations. The effect of multiplicative 
phase noise in the amplifier for the two cases is shown in 
Fig. 5(b). It is clear from Fig. 5(b) that this noise source 
generates signals which are indistinguishable from those 
caused by actual frequency variations, and which are due 
only to the presence of the coherent carrier. Thus a reduc- 
tion in amplifier noise is effected which is proportional to 
the degree of carrier suppression of the microwave signal 
at its input. 

Oscillator phase noise is thus determined by a com- 
bination of mixer noise (reduced by amplifier gain) and 
amplifier noise (reduced by the degree of carrier suppres- 
sion). For example, if mixer noise of —135 dB// per Hz 
were reduced by 25 dB of amplifier gain to —160 dB// 
per Hz, and amplifier noise of —120 dB// per Hz were re- 
duced by 40 dB of carrier suppression to the same value, 
the combined noise of -157 dB// per Hz would determine 
oscillator performance. For a loaded Q of 10,000 (intrinsic 
Q = 20,000), this would allow an oscillator phase noise 
of S<^f)\osc = —43 dB// 3 per Hz, a value superior to any 


room-temperature microwave oscillator to date. For room 
temperature and thermoelectrically cooled sapphire res- 
onators with Qs of 10 5 and 10 6 , performance would be 
superior to that of any available source at S^f) | 0iC = 
—63 dB// 3 per Hz and -83 dB// 3 per Hz, respectively. 

C. Oscillator With RF Feedback 

1. Noise reduction. Figure 1 shows a conven- 
tional microwave oscillator excited by direct RF feedback. 
As discussed earlier, any slow phase shift in the amplifier 
is converted by the feedback process into a frequency shift 
of the oscillator output as required by the condition of con- 
stant phase shift around the loop (<f>i 0 op = 2n7r), with the 
conversion constant depending on the resonator Q. The 
total signal returned from the input port to the resonator 
is the superposition of two parts, a reflected constant sig- 
nal equal in magnitude to the input signal, and an emitted 
part proportional to the instantaneous RF amplitude in 
the resonator. In the previous section, the critically cou- 
pled case was discussed, where on resonance the net re- 
turned signal was identically zero. If instead the cavity is 
slightly over-coupled, so that the signal emitted from the 
resonator is larger than the reflected signal, the net signal 
returned from the resonator will not be zero on resonance 
but will have a small, constant value. Figure 8(a) shows 
the configuration for an oscillator in which the small re- 
turned signal is amplified and returned to the cavity to 
induce oscillation. RF feedback of a phase and magnitude 
that allows oscillation on resonance will also induce neg- 
ative phase feedback that reduces the effect of amplifier 
phase fluctuations on the oscillator frequency. 

Figure 9(a) shows these various signals in phasor form 
for the case Q 1, with weak magnetic coupling to the 
resonator achieved by means of an iris at the end of a 
waveguide or by a shorted coaxial line. Shown are the 
forward signal voltage amplitude f, reflected signal r (di- 
rectly reflected by the coupling port), emitted signal e, 
and net returned signal n, as shown for the condition of 
resonance (test frequency = resonator frequency.) Signals 
are measured at the effective plane of the weak coupling 
port. For frequencies significantly outside the bandwidth 
of the resonator, the net returned signal is approximately 
equal to r; thus, out-of-bandwidth oscillation is prevented 
by the phase reversal between n and r. In order to achieve 
oscillation at resonance, the gain and phase shift around 
the loop must regenerate f from n. For the slightly over- 
coupled case shown, this corresponds to a net phase shift 
2 n 7 r, and gain to overcome the signal amplitude reduction 
| f/r |. While small losses due to transmission through 
various circuit elements must also be made up by the gain 
element, they can be ignored for this analysis. 
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Intrinsic and external Qs, Qi and Q e) describe the 
effect of resonator and coupling losses and combine to form 
the loaded Q\ 

qt 1 = qt 1 +q:' 

which defines the operational bandwidth of the resonator. 
Using standard circuit analysis, the amplitude of the res- 
onator response to the forward signal can be written: 

i g i_ h .. 1 (2) 

Q,W) 

where q = Qi/Q e is a loading factor, v Q is the resonance 
frequency, and Si/ is the frequency deviation from reso- 
nance. The phase of the resonator response is similarly 
given by 

tan(^) = (3) 

Vo 


°=£ +I m 

For the oscillator shown in Fig. 1, a small phase shift 
0 in the amplifier gives a corresponding phase shift —6 
across the resonator, with a resultant frequency shift given 
by the phase slope in Eq. (4). The configuration shown in 
Fig. 8(a) gives a reduced resonator phase shift and thus 
reduced frequency shift. This reduction is now calculated. 
Figure 9(b) shows the self-consistent phasor diagram for 
the oscillator of Fig. 8(a), with a slight amplifier phase 
shift 0. The instantaneous frequency is determined by the 
smaller angle <f > , together with the resonator phase slope 
2 Q\/v 0 \ thus, the ratio <t>/6 describes the phase noise reduc- 
tion of Fig. 7 compared to Fig. 1. The diagram is oriented 
so that the direction of e is constant. Graphically solving 
n = e 4- F shows that the effect of a rotation 0 between 
n and f due to the amplifier gives an angle <f> between f 
and e. The angle <f> in turn determines the frequency shift 
via Eq. (3). A straightforward evaluation for the geometry 
shown gives 


for a slope at resonance of 

M _ 2Qi_ / 4 x 

6v v 0 

Now calculate the amplifier gain required for oscilla- 
tion at resonance (Si/ « 0). From the nature of the cou- 
pling, 


r = -f 

and from the equation above 


r=f x 


2? 

7 + 1 


These can be combined to give 


(5) 


9 = <f> -b sin 1 (-4-y x sin(<^ 

which can be approximated in the limit of small angles to 
give 

<t>/0 « W r (8) 

1 -I- pry 

I n | 

This approximation will hold to high degree of accuracy 
because of the very small value of the phase fluctuations 
involved. Using Eqs. (6) and (5), Eq. (8) can be rewritten 
in terms of the amplifier gain as 


< f )/0 & 


1 

l + G 


n = f -j- e = 


f x 


<7-1 

i 


requiring a gain of 


The factor <j>/9 describes the reduction in phase vari- 
ation across the cavity compared to that across the am- 
plifier. It also describes the improvement in performance 
due to the circuit in Fig. 8(a). Combining this result with 
Eq. (1), the phase-noise performance of the oscillator is 


_ I f I _ i± i 

| n | q- 1 


( 6 ) 


s+w u=(m 2 (‘2Qi) 


-2 


(y) 2 ^(/) 


amp 


for oscillation. The over-coupled condition depicted in 

Fig. 9 corresponds to q > 1. If q = 1 + 6q, the gain For example, for G - 10 (20 dB of amplification), <f>/6 is 
requirement can be rewritten as 1/11 (22 dB of noise reduction). If the amplifier has noise 
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of —120 dB // per Hz (or S^(/) | amp = 10 12 // rad 2 per 
Hz), oscillator performance will be 

St(f) | ou « = 0.8 x 10 -14 (2Q/) -2 (^J) 

which for a frequency of 10 GHz and Qi of 10 5 gives 


a narrow bandwidth is required. For the example here, 
with a path length of 10A and for G = 10 (20 dB), fil- 
ter attenuation of 20 dB must be achieved at a frequency 
offset of 2.5 percent, where the circuit length alone intro- 
duces a 90-deg shift. This requires a filter bandwidth of 
« 0.25 percent, a value that can be achieved with low loss 
using conventional techniques. 


St(f) \out = 2 x 10 - 5 // 3 

or -47 dB per Hz at /= 1 Hz offset frequency. For this 
same case, Eq. (7) shows that the degree of over-coupling 
required is 


or 


and 


6q = 


2 

G- 1 


2 

9 


Q± 

Qe 


1 + 6q 


n 

9 


O ' = qFW = O ‘ x T 57^ = 045xO ‘ 


2. Loop stability. The oscillation condition for the 
direct RF feedback configuration is such that the directly 
reflected signal has a net phase shift of 180 deg with re- 
spect to the signal emitted from the resonator. Thus the 
loop will not oscillate at frequencies outside the passband 
of the stabilizing resonator (where very little signal is ab- 
sorbed or emitted by the resonator), unless the phase is 
shifted by other elements in the circuit. Unfortunately, 
the path length of the circuit alone is sufficient to add 
such a shift if the frequency is slightly varied. Thus, for a 
path length of 10A, a 180-deg phase shift will occur with 
a 5-percent frequency change. Even worse, at these ±5- 
percent frequencies, the loop gain will be nearly G, a value 
much greater than the loop gain at the desired mode (ap- 
proximately unity). These oscillations do not involve any 
high-Q resonance, and can only be prevented by the in- 
troduction of a filter whose function is to reduce the gain 
(as the frequency is varied away from i/ 0 ) to a value less 
than unity by the time the phase shift due to all sources 
reaches 180 deg. Because each stage of a filter introduces 
a phase shift of £3 90 deg by the time substantial attenu- 
ation is achieved, a single-stage filter must be used. The 
phase margin for the rest of the circuit is thus reduced to 
ss 90 deg by the filter’s presence. Because a single-stage fil- 
ter attenuates relatively slowly as the frequency is varied, 


The expressions for amplitude response and phase 
shift for a single-stage filter are identical to those already 
presented in Eqs. (2) and (3). The shift due to a transmis- 
sion line is given by 


where p is the effective path length. Figure 8(b) shows an 
oscillator with the added filter; Figs. 10 and 11 show the 
amplitude and phase response for a single-stage filter with 
Q = 3000, and the added phase due to an effective path 
length of 1 m. For Fig. 11, an attenuation of 35 dB is gen- 
erated before the total phase shift reaches 180 deg, thus 
allowing < 35 dB of loop gain for the reduction of ampli- 
fier phase noise. It can be seen from the figures that the 
attenuation of this filter is nearly 35 dB by the time the 
total phase variation due to filter and transmission path 
reaches 180 deg. While 3000 is a relatively high Q for a 
single-mode filter, it can be attained with low loss. It is 
clear that usable gain and consequent usable noise reduc- 
tion of 20 to 30 dB can be achieved with this technique. 


IV. Conclusions 

Typical low- noise 10-GHz (X-band) oscillators use a 
single transistor or other active semiconducting device for 
excitation; while a more elaborate STALO configuration is 
used for the lowest possible phase noise. With the develop- 
ment of sapphire whispering-gallery-mode resonators with 
Qs above 10 5 to 10 7 at 10 GHz, the possibilities have been 
considerably enhanced. Whereas lower frequency SAW or 
BAW quartz-crystal oscillators had far lower noise than 
their higher frequency counterparts, they are rivalled by 
an X-band oscillator using the sapphire resonator [9]. To- 
gether with the improved oscillator circuits developed here, 
such a resonator may make possible close-in phase noise 
lower than that of any noncryogenic frequency source. Fur- 
thermore, cooling by means of thermoelectric coolers or 
liquid nitrogen may make practical frequency sources with 
greatly reduced phase noise. 

New design configurations for STALOs and direct RF 
oscillators allow reduced phase noise in comparison to con- 
ventional configurations. By appropriate use of carrier sup- 
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pression, a small signal is generated which suffers no loss 
of loop- ph cise information or signal-to-noise ratio. This 
small signal can be amplified without degradation by mul- 
tiplicative amplifier noise, and can be detected without 
saturation of the detector. 

Figure 12 shows phase-noise calculations for an im- 
proved sapphire whispering-gallery-mode oscillator and 


STALO for configurations as shown in Figs. 8 and 7. Use 
of a cryogenic sapphire resonator allows a further improve- 
ment of 20 to 43 dB. Quality factors are assumed to be 
Qi = 2 x 10 5 at room temperature, Qi = 2 x 10 6 at 170 K, 
and Qi = 3 x 10 7 at 77 K. Noise plots for various con- 
ventional 10-GHz frequency sources are also shown. The 
multiplied 5-MHz crystal oscillator presently represents the 
best performance available at X-band. 
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AMPLITUDE 




Fig. 1. Block diagram of simple oscillator with direct RF feedback. 
Output phase noise is derived from amplifier noise together with 
phase slope of resonator; phase is adjusted to give 2 n tt radians 
around loop at the center of the sapphire resonator passband. 


OUTPUT 



Fig. 2. Block diagram of stabilized local oscillator (STALO) with double-balanced mixer type 
phase detector. Mixer noise plays the same role as amplifier noise in Fig. 1 ; phase Is adjusted 
to give / and r signals in quadrature. 
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OUTPUT 



Fig. 3. STALO with different configuration but functionally identical to that of Fig. 2. Signal 
returned from resonator Is superposition of the resonator signal and the constant reflected 
signal. 



Fig. 4. RF envelope of returned signal for critical coupling as 
frequency v Is varied. Phase Inversion at center allows linear de- 
pendence In mixer output with frequency for arbitrarily small fre- 
quency errors. 






{a) 


CARRIER 

SUPPRESSION 

SIGNAL 




HORIZONTAL SCALE FOR BOTH GREATLY ENHANCED 


Fig. 5. Phasor diagram showing the effect, with and without car- 
rier suppression, of (a) frequency error and (b) amplifier phase 
noise on the RF resonator signal. Also shown are the constant 
carrier suppression signal and mixer reference signal. Note that 
the effect of frequency error on the component of the RF resonator 
signal in phase with the reference Is unchanged by 20 dB carrier 
suppression, while the effect of amplifier phase noise Is reduced 
by 20 dB. 


30 



OUTPUT 



Fig. 6. Configuration to allow increased power into the high-0 resonator without saturating the 
mixer; the resultant increase in sensitivity of up to 17 dB reduces the consequence of mixer 
noise by the same factor. 


OUTPUT 



Fig. 7. STALO configuration for reduced phase noise. Critical coupling to resonator and 
operation very near allow Insensitivity to phase noise of amplifier; amplifier gain allows 
reduction of phase detector noise. 











Fig. 9(a). Phasor diagram showing forward, reflected, emitted, and 
net signal amplitudes f, r, e, and n, for a slightly over-coupled 
resonator at resonance. 



Fig. 8. Configuration of reflection oscillator: (a) with direct RF 
feedback, (b) added filler prevents spurious oscillation. 



Fig. 9(b). Self-consistent phasor diagram 
showing the effect of amplifier phase shift 
6 on operation of the oscillator shown in 
Fig. 7. Feedback gain is the same as in 
Fig. 9(a). 
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Fig. 10. Amplitude response of the single-stage filter In Fig. 8(b). 
More stages cannot be used because the added phase shift would 
allow oscillation within the filter bandwidth. 



Fig. 11. Phase response of the single-stage filter together with 
phase response due to path lengths, for path length of 1 m, Q = 
3000, and v Q = 8 GHz. Path-length Induced phase shift would be 
greater for lower Q or a longer path length. 



Fig. 12. Phase-noise calculations for Improved sapphire whlsperlng-gallery-mode oscillator 
and STALO shown in Figs. 8 and 7. Use of a cryogenic (170 K to 77 K) sapphire resonator 
allows further improvement by 20 to 43 dB. 
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This article presents an analysis of the effect of the line width of a single- 
longitudinal-mode laser on the frequency stability of a frequency reference transmit- 
ted over single-mode optical fiber. The interaction of the random laser frequency 
deviations with the dispersion of the optical fiber is considered to determine the- 
oretically the effect on the Allan deviation (square root of the Allan variance) of 
the transmitted frequency reference. It is shown that the magnitude of this effect 
may detennine the limit of the ultimate stability possible for frequency reference 
transmission on optical fiber, but is not a serious limitation to present system per- 
formance. 


I. Introduction 

Ultrastable fiber optic transmission of hydrogen maser 
reference signals is presently operational at the Goldstone 
facility of the NASA/JPL Deep Space Network [1]. This 
capability supports radio science experiments such as Con- 
nected Element Interferometry (CEI) by enabling phase- 
coherent arraying of widely separated antennas in real 
time. Also, distribution of a centralized maser reference 
throughout the entire complex eliminates the need for a 
hydrogen maser frequency standard at each Deep Space 
Station, with substantial cost savings and increased relia- 
bility. 

A reference signal produced by a hydrogen maser fre- 
quency standard is presently distributed over distances up 
to 29 km, with differential fractional frequency stability 
^ « 10' 15 for 1000-second averaging times. Although the 
present fiber optic distribution capability is as stable as the 


hydrogen maser frequency standard, the ideal distribution 
system should be an order of magnitude more stable than 
the distributed signal. With the promise of trapped-ion 
frequency standards [2] and superconducting cavity masers 
[3] that will both provide more stable frequency references, 
fiber optic link stability of 10“ 18 at 1000 seconds will be 
required for stable distribution. 

The demanding requirements that a frequency refer- 
ence distribution system must meet necessitate the exam- 
ination of all sources of instability at levels far beyond the 
needs of typical analog and digital fiber optic communi- 
cation systems. Presently, laser source amplitude noise 
and thermal variations of the optical fiber have been iden- 
tified as the limiting factors to distribution system per- 
formance. Improved lasers with lower intensity noise and 
single-longitudinal-mode operation will be employed in the 
near future. A thorough examination of fiber optic sys- 
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terns and components has indicated that laser frequency 
deviations may limit system performance as lower ampli- 
tude noise lasers become available. However, a quanti- 
tative analysis of the effect of laser frequency noise on a 
narrow-band frequency distribution system has not previ- 
ously been performed. 

The present analysis theoretically estimates the effects 
of laser frequency fluctuations on the amplitude and phase 
stability of a frequency reference transmitted on single- 
mode optical fiber. An expression for the phase noise spec- 
tral density of the modulation signal due to the frequency 
noise spectral density of the laser is derived and then used 
to calculate the expected Allan deviation of the trans- 
mitted reference signal. The laser-induced phase noise is 
shown to depend on the modulation signal frequency, fiber 
length, and fiber dispersion, as well as the magnitude of 
the laser frequency fluctuations. It is shown that the differ- 
ential frequency stability of a single-mode fiber optic link 
is fundamentally limited by laser frequency noise. Thus, 
as laser intensity noise is reduced, the laser frequency noise 
will limit transmission stability. 

II. Fiber-Laser Interaction 

The dispersion of optical fiber causes various optical 
frequencies to travel with different velocities. Optical car- 
rier frequency deviations couple with the dispersion of the 
fiber to produce random phase deviations in the envelope 
of a modulation signal, thereby degrading its phase sta- 
bility. Optical fiber acts as a frequency discriminator to 
translate random frequency deviations of the laser into 
random phase deviations of the RF modulation envelope. 
Although every effort is made to operate the laser at the 
minimum dispersion point of the fiber, the slope of the 
fiber index of refraction versus wavelength is typically not 
zero. As the laser frequency deviates, the signal experi- 
ences changes in the fiber index of refraction that cause 
phase shifts of the modulation envelope. 

Under bias current modulation, a semiconductor laser 
exhibits changes in wavelength, or chirp, that are syn- 
chronous with the bias modulation due to the change in 
refractive index of the laser gain medium [4]. Lasers also 
exhibit random frequency fluctuations due to the quantum 
phenomenon of electron-hole recombination in the gain me- 
dia, with an attendant change of refractive index [5]. Tem- 
perature excursions of the laser diode also affect the index 
of refraction and the lasing wavelength, but with time con- 
stants from minutes to hours. 

In digital transmission systems, laser chirp is the pre- 
dominant limit on transmission distance [6,7]. These wide- 


band systems are sensitive to phase deviations of the mod- 
ulation envelope at all frequencies. However, in such sys- 
tems, spontaneous emission noise is ignored since chirp is 
the overwhelming effect [7]. 

In contrast to digital or wide-band analog transmis- 
sion systems, frequency distribution systems employ a 
narrow-band loop filter at the output of the fiber receiver. 
Therefore, high-frequency deviations of the modulation en- 
velope phase are averaged out, leaving only the laser noise 
within the loop bandwidth. An analysis of the effect of 
close-to-carrier laser frequency noise on a long-distance fre- 
quency reference transmission system has not been pub- 
lished (to our knowledge), so the effects of laser frequency 
noise have not been known. Also, at the levels of signal to 
noise ratio (SNR) and frequency stability of the frequency 
distribution systems under consideration, it has been un- 
clear what role laser frequency noise plays in determining 
the ultimate system stability attainable. The present anal- 
ysis provides an estimate of the role of intrinsic laser fre- 
quency noise in a narrow-band frequency distribution sys- 
tem to determine the level at which system performance 
might become limited. 

Since high-frequency laser chirp can be neglected in 
a narrow-band system, the present analysis considers only 
the intrinsic laser frequency noise within the bandwidth 
of the output filter. As such, the analysis applies to any 
type of laser system, although semiconductor lasers are 
typically used. Externally modulated Nd:YAG lasers at 
1318 nm may be an attractive alternative to semiconduc- 
tor lasers for long-haul analog signal transmission. The 
present analysis applies equally well to these types of lasers 
by substitution of the appropriate parameters. 


III. Analysis 

Intrinsic laser frequency noise has its origins in the 
discrete random photons spontaneously emitted into the 
lasing mode that cause random frequency changes of the 
laser wavelength [8-10]. The high-frequency character of 
this noise is well-known. It is basically flat within the mod- 
ulation bandwidth, peaking at the relaxation oscillation 
resonance of the laser diode cavity, usually in the tens-of- 
gigahertz region [8, 9]. Within tens of kHz of the carrier, 
the frequency noise exhibits a 1// character [10]. Close-to- 
carrier measurements of laser noise are limited to within 
about 10 kHz, due to the physical difficulty of fabricating 
frequency discriminators with sufficient resolution at opti- 
cal frequencies. It is this low-frequency FM noise that is 
of interest for the analysis of narrow-band frequency dis- 
tribution systems. 
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We desire an expression for the phase noise spectral 
density of the modulation signal as a function of the spec- 
tral density of laser frequency fluctuations. The resultant 
phase noise density may then be used to calculate the ex- 
pected Allan deviation of the reference signal, provided the 
character of the laser frequency noise is known. 


and material composition of the fiber. The measured result 
for typical single-mode fiber at 1300 nm is [11] 

= 2 70-l m- 1 (6) 


Consider a single-longitudinal-mode laser coupled to 
a single-mode fiber. The laser output is amplitude modu- 
lated at RF frequency Q. The phase delay for the modu- 
lation signal envelope along the fiber is given by 


<t> = 


2 7vn LQ 
c 


(rad) 


(i) 


where n is the fiber index of refraction, L is the fiber length, 
and c is the speed of light in a vacuum. It is assumed 
that the laser operates in a single longitudinal mode at 
A = 1.3 fun. Now, consider the effect of a perturbation, 
such as a change in ambient temperature, on the refractive 
index of the fiber. This causes a phase change 


d<f> - 


dn 2tt L 
c 


(rad) 


( 2 ) 


Multiplying the numerator and denominator on the right- 
hand side by d\ gives 

2 tt LQd\ f dn\ 

d * = — —{dXj < nd > < 3 > 

By writing dX in terms of the laser frequency z/, the phase 
deviations may be expressed in terms of the laser frequency 
deviations, which have the same (random) time depen- 
dence. Thus 


Inserting this value into Eq. (5) and substituting the 
appropriate constants produces the simple relation 

S*(/) = S„(f) L 2 Q 2 (1.02 x 10~ 51 ) (rad 2 /Hz) (7) 

The above quantity is the mean-square phase-noise 
spectral density at an offset / from the modulation signal, 
fi. This is the spectrum which would be observed if a 
perfect oscillator (i.e., an oscillator with no phase noise) 
modulated the laser and if the output of the photodetector 
were compared to a second perfect oscillator, as in a phase 
noise measurement system. 


IV. Numerical Estimates 

The FM noise spectrum of distributed feedback-type 
(DFB) single-mode lasers typically used in fiber optic dis- 
tribution systems exhibits a power-independent 1// char- 
acter at low frequencies (below about 1 MHz). In the 
modulation band, the FM noise is white and inversely pro- 
portional to optical power [10]. The physical mechanism 
responsible for the 1// behavior is thought to be the trap- 
ping of carriers due to impurities and interfacial bound- 
aries, but it is not fully understood. The white portion of 
the spectrum is due to spontaneous-emission events and is 
adequately modeled by theory [5, 8]. 


1(/x , ,.27 tLQA 2 f dn\ / w x /.x 

d<j>(t) = du(t) f-yj (rad/sec) (4) 

For the analysis of frequency stability, it is more con- 
venient to look at the last expression in the frequency do- 
main by Fourier transforming as follows: 


The frequency noise spectrum of typical DFB lasers 
has been measured experimentally [10]. The above- 
mentioned physical mechanisms may be modeled as 

&(/) = £+ J (8) 


s+(f) = s„(/) 


— 2tt L Q A 2 dn 


t 2 


(rad 2 /Hz) (5) 


In this expression, S$(f) is the spectral density of the 
phase fluctuations at an offset frequency / from the RF 
signal; the fluctuations are induced by the spectrum of 
random frequency deviations, S u {f) } of the laser. 


The variation of the effective fiber index of refraction 
with wavelength depends on the waveguide parameters 


where P is the average output power of the laser and / is 
the frequency offset from the optical carrier. C and K are 
empirically determined constants. For the Fujitsu DFB 
laser diodes measured [10], C = 1.5 x 10 4 (IIz ■ W), and 
K = 5.8 x 10 11 (Hz 2 ). 

The frequency noise spectrum of a typical DFB laser is 
illustrated in Fig. 1. As laser power is increased, the white 
portion of the noise spectrum decreases proportional to 
P~ l , A numerical estimate of the additive RF phase noise 
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requires that only the l/f portion of S u (f) be inserted into 
Eq. (7), which gives 

S*(/) = 5,8 X /- -Q 2 L 2 (1.05 x 1(T 51 ) (rad 2 /Hz) 


The relation between the laser-frequency noise-limited 
Allan deviation of the 29-km link and the Allan deviation 
of a typical hydrogen maser is plotted in Fig. 2. This 
represents the ultimate frequency stability attainable with 
such a link, provided all other noise sources are negligible. 


= L 2 Q 2 — X / 10 -— (rad 2 /Hz) (9) 

The l/f laser frequency noise is converted to 1//, or 
“flicker,” phase noise in the fiber optic distribution system. 
This level of 1// phase noise depends on the inherent quan- 
tum fluctuations of the laser frequency and represents the 
ultimate phase noise floor of the system. 

For flicker phase noise, the Allan deviation (square 
root of the Allan variance) may be calculated from the 
following relation [12]: 

'•M = (10) 

where fh is the frequency cutoff of the phase noise. In this 
case, fh is one-half the bandwidth of the output filter. 


V. Present State of the Art 

The ultimate link stability plotted in Fig. 2 will only 
be attained if all other noise sources are negligible. In real- 
ity, other noise sources do contribute to the link stability. 
This is illustrated in Fig. 3, where an actual measurement 
of the 29-km Goldstone link is plotted in addition to the 
maser and the ultimate-stability-link curves of Fig. 2. 

In present-day systems, the Allan deviation 1/r inter- 
cept is set by the SNR of the fiber link, which is determined 
by the laser intensity noise. The SNR of a typical high- 
performance analog link is 120 dB/Hz. Figure 4 illustrates 
the 1-sec Allan deviation as a function of fiber length. It 
is immediately apparent from Fig. 4 that the laser fre- 
quency noise does not limit frequency distribution system 
performance at this time, since the laser SNR dominates. 
As lower amplitude-noise lasers become available, the laser 
frequency noise floor of the Allan deviation may begin to 
limit frequency reference distribution system performance. 


Substituting Eq. (9) into the last expression, the mod- 
ulation frequency cancels, and the expression for the Allan 
deviation reduces to 


<Ty( T ) 



L 2 6 x 10- 4O ln(8.88A 


0 


( 11 ) 


The laser-induced flicker phase noise thus sets the 
minimum bias level of the 1/r section of the Allan de- 
viation plot. The fact that the last expression does not 
depend on the RF modulation frequency, is significant. 
This implies that moving to higher or lower modulation 
frequencies for reference signal distribution will not alter 
the laser frequency noise “floor” of the Allan deviation. 

For purposes of comparison, we consider an actual fre- 
quency distribution link in use at the NASA/JPL Gold- 
stone Deep Space Communications Complex. The longest 
frequency distribution run is 29 km. Assuming that the 
output filter bandwidth, f hi is 10 IIz, the Allan deviation, 
calculated from Eq. (11), is 


, x 4.1 x 1CT 16 



( 12 ) 


Figure 5 depicts fiber link Allan deviation at 1 second 
as a function of link SNR. The laser relative intensity noise 
(RIN) sets the SNR of the fiber link for short distances [13]. 
Also shown is the Allan deviation floor due to laser fre- 
quency noise for a 29-km link. This plot shows clearly that 
laser frequency noise limits the frequency stability “floor” 
of the fiber link to 4 x 10“ 16 /r for SNR > 145 dB/Hz. 
Systems with as high as 150 dB/IIz SNR may be achiev- 
able with externally modulated high-power semiconductor- 
diode-pumped Nd:YAG solid-state lasers, or through the 
use of squeezed light generated directly from semiconduc- 
tor lasers. As these system improvements are realized, the 
low-frequency l/f FM noise of the laser may begin to limit 
system performance. 

A final observation: Since the fiber optic transmission 
system converts laser frequency noise to RF phase noise, it 
may be the case that a stabilized fiber optic link comprises 
a very accurate system for measuring the frequency devi- 
ations of lasers close to the optical carrier. This approach 
is under consideration for future research. 


VI. Conclusion 

At present, the noise floor of fiber optic distribution 
systems is determined by the laser signal to noise ratio 
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(SNR) in the RF modulation band. However, lasers with 
lower relative intensity noise (RIN) or those which use 
squeezed light promise increases in link SNR, and passive 
and active temperature-stabilization schemes can improve 
link stability at long averaging times. As these improve- 
ments in components and systems are realized, the funda- 
mental limit for frequency stability due to laser frequency 
noise may be reached. 

The present analysis provides the contribution to the 
phase noise of a transmitted frequency reference due to 
single-mode laser frequency deviations. Through interac- 
tion with the dispersion of the fiber, the 1// FM noise close 
to the optical carrier is converted to 1// phase noise close 
to the RF reference signal. The additive 1// laser-induced 
phase noise is a function of the fiber dispersion and length 
and determines the ultimate Allan deviation floor of the 


fiber optic distribution system in the 1- to 100-second re- 
gion. 

For the longest fiber optic frequency distribution link 
in the NASA/JPL Deep Space Network (29 km), using 
data for commercially available DFB lasers, the analysis 
indicates that the link Allan deviation is limited to 4 x 
10 16 /t* (for averaging times between 1 second and 100 
seconds). This stability limit will be reached at link SNR of 
145 db/IIz, which is 25 dB better than the present system. 

Further increases in SNR will not yield higher link sta- 
bility unless laser frequency noise is decreased as well. The 
laser FM noise stability limit is two orders of magnitude 
higher stability than the best current frequency standard, 
which indicates that laser frequency noise will not limit 
fiber optic frequency distribution capability in the foresee- 
able future. 
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Fig. 1. Typical DFB laser frequency noise spectrum. 


Fig. 2. Hydrogen maser and laser FM noise-limited fiber optic 
link frequency-stability comparison. 
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Fig. 3. Comparison of Allan deviation: H-maser, actual 29-km 
link, and theoretical FM noise-iimited link. 


Fig. 4. Comparison of frequency stability at one second versus 
fiber length for maser, fiber optic links of various SNR, and noise- 
limited laser FM. 
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Fig. 5. Comparison of fiber link frequency stability at one second 
versus SNR with laser FM noise-limited 29-km link. 
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Thermal Coefficient of Delay for Various Coaxial 

and Fiber-Optic Cables 

G. Lutes and W. Diener 

Communications Systems Research Section 


This article presents data on the thermal coefficient of delay for various coaxial 
and fiber-optic cables, as measured by the Frequency and Timing Systems Engineer- 
ing Group and the Time and Frequency Systems Research Group. The measured 
pressure coefficient of delay is also given for the air-dieiecfcric coaxial cables. The 
article includes a description of the measurement method and a description of each 
of the cables and its use at JPL and in the DSN. An improvement in frequency and 
phase stability by a factor of ten is possible with the use of fiber optics t 


I. Introduction 

Highly stable frequency and timing reference signals 
generated by atomic frequency standards enable the 
NASA/JPL Deep Space Network (DSN) to make precise 
measurements of relative time and position. These mea- 
surements are used to locate spacecraft and guide them 
to their destinations. They are also used to support ra- 
dio science, radio and radar astronomy, very long baseline 
interferometry, and geo dynamics. 

Within a Deep Space Communications Complex 
(DSCC), high-stability distribution systems are used to 
distribute the frequency and timing reference signals de- 
rived from the frequency standard to the subsystems that 
use them. These distribution systems use coaxial or fiber- 
optic cables as the transmission medium. Delay changes in 
these cables are often the major contributor to the phase 
and frequency instability of the distributed reference sig- 
nals. 


Detailed data on delay stability of cables is gener- 
ally not available from manufacturers. Since this infor- 
mation is needed to design frequency and timing distribu- 
tion systems, the Time and Frequency Systems Research 
Group and the Frequency and Timing Systems Engineer- 
ing Group have measured the thermal coefficient of delay 
(TCD) and the pressure coefficient of delay (PCD) for vari- 
ous coaxial and fiber-optic cables. This article presents the 
results of these measurements. A key finding is that an im- 
provement in frequency and phase stability by a factor of 
ten is possible, compatible with anticipated requirements 
on the DSN for supporting gravitational- wave experiments 
and connected-element interferometry. 

II. Background 

Delay changes degrade the phase and frequency stabil- 
ity of a signal transmitted through a transmission line [1], 
When the temperature of a transmission line changes, the 
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result is a corresponding delay change through the trans- 
mission line* A pressure change in an air-dielectric trans- 
mission line results in an additional delay change, since a 
dielectric constant change results from a pressure change. 


The TCD is a measure of delay change in a signal path 
that results from a temperature change. Its value is often 
given in parts per million per deg Celsius (ppm/deg C), 
which is the change in delay divided by the total delay 
normalized to 1 million units. In equation form it is 


TCD = 


At(l0 6 ) 

tAT 


( 1 ) 


where At is the change in delay through a signal path, t is 
the nominal delay through the signal path, and AT is the 
change in temperature. Similarly, the pressure coefficient 
of delay (PCD) is a measure of delay change in a signal 
path that results from a pressure change. Like TCD, the 
value of PCD is often given in ppm/psi. 


In practice, phase measurements can be made with 
much higher resolution and accuracy than direct delay 
measurements. Therefore, to obtain the data presented in 
this article, phase changes were measured and converted to 
delay changes. Since phase delay has a linear relationship 
to time delay, Eq. (1) can be re witten in terms of phase as 


TCD = 


A0(1O 6 ) 

QAT 


( 2 ) 


where A© is the change in phase delay through a signal 
path, and 0 is the nominal phase delay through the signal 
path. 

The TCD and PCD of various cables have been mea- 
sured, and the data presented in this article will enable 
designers to identify a cable with suitable stability perfor- 
mance for a particular application. 


111. Measurement Method 

Figure 1 shows a block diagram of the measurement 
system. A reference signal is separated by an RF power 
splitter into two signals. One of these signals is passed 
through the cable under test, and the other signal is used 
as a reference. The signal at the output of the cable under 
test drives one port of an RF phase detector. The reference 
signal from the RF power splitter passes through a man- 
ual RF phase shifter and drives the other port of the RF 
phase detector. A lowpass filter on the output of the RF 
phase detector eliminates the RF signals, leaving only the 
DC component, which is measured with a DC voltmeter. 


A temperature-controlled test chamber contains the cable 
under test. 

The phase detector’s sensitivity is measured in volts 
per deg phase, and must be calibrated before measure- 
ments can be made. For most phase detectors, the out- 
put voltage vs. phase curve, commonly called the S-curve, 
is sinusoidal if the proper input signal levels are applied. 
When this is the case, it is only necessary to measure the 
peak voltage out of the phase detector in order to calibrate 
it. The peak voltage is obtained by adjusting the manual 
phase shifter for 0 deg or 180 deg phase difference between 
the signals at the phase detector input ports. In terms of 
the peak voltage, the slope of the S-curve, in volts per deg 
phase, near zero volts (90 deg) is 


K e 



( 3 ) 


where V p is the peak voltage out of the RF phase detector. 
If the phase detector curve is not sinusoidal, it is necessary 
to determine the slope of the curve by other means. 


The measurements are made with the phase difference 
between the signals applied to the phase detector set near 
90 deg. This is where the output of the phase detector is 
near zero volts. For these measurements, the phase detec- 
tor output voltage vs. phase change is assumed to be lin- 
ear in this region. However, the total phase-delay change 
in the cable under test should be no more than ±30 deg 
over the test temperature range. This keeps errors due to 
the nonlinearity of the S-curve to less than 5 percent. A 
lower test frequency permits testing very long cables with- 
out exceeding a 30-deg phase-delay change over the test 
temperature range. 


To make the measurement, the temperature of the 
test chamber is set to a nominal value, usually 25 deg C. 
The manual phase shifter is adjusted to obtain a 90-deg 
phase difference between the signals at the ports of the 
RF phase detector. For this phase difference, the output 
voltage from the RF phase detector is near zero. Once this 
zero- volt reference is established, the temperature in the 
test chamber is changed in steps. For each temperature 
change in the test chamber, the voltage out of the phase 
detector changes, indicating a phase (delay) change. The 
value of the phase-delay change is 


A0 = 


A E 
Kn 


( 4 ) 


where AE is the change in voltage due to a temperature 
change. 
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When the voltage change stabilizes, its new value is 
recorded and the temperature in the test chamber is 
stepped to a new value. This can take from 20 min to 
1 hr per step. The temperature is normally changed in 
steps of 5 deg C. This usually results in a large enough 
phase change to be accurately measured. Yet the phase 
change is not so large that it results in significant error 
due to the nonlinearity of the phase-detector curve. 

From Eqs. (2), (3), and (4), the TCD in terms of the 
known and measured parameters is 


TCD = 


A£;(10 6 )(3 x 10 8 )a 
2fhrV p AT 


( 5 ) 


and in the DSN is described in this section. The TCD for 
each cable is given and the PCD is given for those cables 
that are normally pressurized. 

Loose-tube single-mode fiber-optic cable is used be- 
tween Deep Space Stations (DSSs) at the Goldstone 
DSCC. These cables, supplied by several manufacturers, 
use Corning single-mode fiber and are very similar in de- 
sign. Figure 3 depicts the general cable design. The per- 
formance characteristics are virtually identical for all of 
the manufacturers, and are given in Table 1 [2, 3, 4]. A 
graph of TCD with respect to temperature is shown in 
Fig. 4, which compares this type of cable to low-TCD op- 
tical fiber and the best coaxial cable (64-875). 


where a is the propagation constant for the cable, / is 
the measurement frequency used, l is the physical length 
of the cable, and AT is the change in temperature of the 
transmission line. 

A plot of the phase detector output vs. time and tem- 
perature for a 39.6-m length of RG-223/U cable is shown in 
Fig. 2. Each step in phase was the result of a temperature 
step of 5 deg C in the test chamber. The total tempera- 
ture range is from 35 deg C at the bottom of the plot to 
15 deg C at the top of the plot. The vertical scale shows 
50 mV per major division (50 mV/div), and the horizontal 
scale shows time in 1 hr per major division (1 hr/div). The 
phase detector’s peak output V p was measured at 1.06 V. 
The propagation factor a for the cable is given by the 
manufacturer as 0.659. A 100-MHz test frequency was 
used. Using the example of the step between 30 deg C and 
25 deg C, shown enclosed in dotted lines at the left side of 
Fig. 2, the change in voltage E is 1.2(5 x 10~ 2 ) = 6 x 10~ 2 . 
Evaluating Eq. (5) for this change in delay using the above 
information, 

(6 x 10~ 2 )(10 6 )(3 x 10 8 )(0.659) 
2(10 s )(39.6)tt(1.06)5 

= 90ppm/deg C at 27.5 deg C 
which is the average temperature of this step. 

IV. Results 

The cables that have been measured in the Frequency 
Standards Laboratory (FSL) at JPL are loose-tube single- 
mode fiber-optic cable, low-TCD fiber-optic cable, and 
metal-based coaxial cables RG-223/U, SF-214, F242-VV- 
2400- AOB, F645-EIA-5160-AO, IICC-12-50J, 64-500, and 
64-875. Each of these cables and its application at JPL 


A new optical fiber with a very low TCD has been 
tested in the FSL. This fiber is manufactured by Sumi- 
tomo Electric, and cables containing four fibers of this type 
are now being procured. They will be tested for distribu- 
tion of several types of signals in the DSN, including fre- 
quency references, time references, local oscillator signals, 
and intermediate frequency (IF) signals. Table 2 lists the 
physical and performance characteristics of this new fiber 
[5]. Figure 5 shows a graph of its TCD with respect to 
temperature. 

RG-223/U is a general-purpose coaxial cable in com- 
mon use at JPL and in the DSN. It is manufactured by 
a number of companies, including Times Wire and Ca- 
ble. Table 3 lists its important physical and performance 
characteristics, and Fig. 6 shows a graph of its TCD with 
respect to temperature [6]. 

SF-214 is also a general-purpose cable in common use 
at JPL and in the DSN. This cable is manufactured by 
Times Wire and Cable. It has lower loss than RG-223, 
and is used where this characteristic is important. It is also 
used in the antenna wrap-ups in the DSN where the cable 
must be flexed. Table 4 lists the important physical and 
performance characteristics of SF-214, and Fig. 7 shows a 
graph of its TCD with respect to temperature [6]. 

F242-VV-2400-AOB is a 3/8-in.-diameter coaxial ca- 
ble with a corrugated outer conductor for flexibility. It is 
used in the FSL for test applications where the cable must 
have low TCD and be flexible to accommodate various test 
configurations. It is manufactured by Flexco Microwave. 
Table 5 lists its important physical and performance char- 
acteristics [7], and Fig. 8 shows a graph of its TCD with 
respect to temperature. 

F645-EIA-5160-AO is a 1-in .-diameter coaxial cable 
with a corrugated outer conductor, manufactured by 
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Flexco Microwave. It was tested for possible use in the 
DSN. Table 6 lists its important physical and performance 
characteristics [7], and Fig. 9 shows a graph of its TCD 
with respect to temperature. The PCD of F645 cable with 
respect to air pressure is shown in Fig. 10. 

HCC-12-50J is a 1 /2-in. -diameter hardline cable used 
in the FSL where delay stability is critical but the cable 
is not flexed. It is manufactured by Cablewave Systems. 
Table 7 lists its important physical and performance char- 
acteristics [8] and Fig. 11 shows a graph of its TCD with 
respect to temperature. 

Two pressurized hardline air-dielectric cables, 64-500 
and 64-875, are used in the DSN where the lowest TCD 
and low attenuation are needed. The 64-875, a 7/8-in. - 
diameter cable, has better delay stability and lower atten- 
uation than the 64-500 cable, which has a 1/2-in. diameter. 
Ilowever, the 64-875 cable is considerably more expensive 
and harder to work with. These two cables were manu- 
factured by Prodelin, which was sold to Cablewave, who 
now manufactures them. Cablewave has informed the ca- 
ble Cognizant Operations Engineer (COE) that they will 
stop manufacturing these cables in the near future. Table 8 
lists the important physical and performance characteris- 
tics for 64-500 cable, and Figs. 12 and 13 show graphs of 


its TCD and PCD. Table 9 lists the important physical 
and performance characteristics of 64-875 [9], and Figs. 14 
and 15 show graphs of its TCD and PCD. 

V. Conclusion 

Several fiber-optic cables and coaxial cables used at 
JPL and in the DSN have been measured to determine 
their TCD. The PCD of the air-dielectric cables was also 
measured. The plots of TCD and PCD given here are 
meant to guide the user in choosing a coaxial or fiber-optic 
cable for use in applications requiring high delay stability. 
Many of the technical parameters needed to make tradeoff 
decisions are given. The costs of these cables as given 
in the tables are to be taken as a guide only. For very 
large quantities, e.g., tens of kilometers, the costs are tied 
largely to material costs and market conditions, and the 
cost of short lengths of cable may be three to four times 
the large-quantity cost. In the normal operating range of 
temperatures from 15 to 35 deg C, the new low-TCD fiber- 
optic cable with superior delay stability permits signals to 
be transmitted with unprecedented frequency and phase 
stability. For instance, compared to the best coaxial cable 
at 25 deg C, the use of low-TCD fiber would improve the 
frequency and phase stability of a transmitted signal by 
more than ten times. 
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Table 1. Physical and performance characteristics of loose-tube, single-mode, fiber-optic cable 


Cutoff wavelength 
Core concentricity 
Cladding diameter 
Coating diameter 
Core diameter 
Spot size 

Dispersion of 1285- to 1350-nm wavelength 

Optical loss, maximum 

RF bandwidth 

Number of fibers 

Nominal weight 

Maximum diameter 

Temperature range 

Maximum tensile rating 
During installation 
Long-term after installation 

Minimum bend radius 
During installation 
Free bend, installed 

Crush resistance, long-term installed 

Maximum vertical rise 

Price 


1130 to 1270 nm 

< 1 fim 

125 ±3jxm 
250 ±15|mn 
8.7^m 
10/xm 

< 3.5 psec/nm-km 
0.5 dB/km 
>100 GHz-km 

18 and 24 
165 kg/km 
15.1 mm 

—40 to 4-70 deg C 

2700 N 
600 N 

300 mm 
150 mm 

50 N/cm 
175 m 

$0.2 5/fiber- meter to $0.35 /fiber- me ter 


Table 2. Physical and performance characteristics of low-TCD, 
single-mode optical fiber cable 


Cutoff wavelength 
Core concentricity 
Cladding diameter 
Coating diameter 
Spot size 

Zero- dispersion wavelength 
Zero-dispersion slope 
Optical loss, maximum 
RF bandwidth 
Price 


1260 nm 

< 0.1 /xm 
125.4/xm 
815.0^m 
9.6jim 
1305 nm 

< 0.083 psec/nm-km 
0.32 dB/km 

>100 Gliz-km 
« $ 4/fiber-meter 



Table 3. Physical and performance characteristics of RG-223/U coaxial cable 


Inner conductor 

Dielectric 

Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Nominal capacitance 

Maximum operating temperature range 

Maximum operating voltage 

Propagation constant 

Nominal loss characteristics, dB/100 ft 


10 MHz 

1.35 

50 MHz 

3.0 

100 MHz 

4.3 

200 MHz 

6.0 

400 MHz 

8.8 

1 GHz 

16.5 

3 GHz 

36.0 

5 GHz 

51.0 

10 GHz 

85.0 


Price, < 300 meters 


Silver- covered copper; outside diameter 
0.035 in. 

Solid polyethylene; outside diameter 
0,116 in. 

Two shielding braids, silver-covered copper 
Black polyvinylchloride 
3/16 in. 

1.0 in. 

0.034 lbs/ft 

50 ohms 

30.8 pf/ft 

-40 to +80 deg C 

1900 volts RMS 

0.659 


$4, 50/m 


Table 4. Physical and performance characteristics of SF-214 coaxial cable 


Inner conductor 

Dielectric 

Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Cutoff frequency 

Nominal capacitance 

Maximum operating temperature range 
Maximum operating voltage 
Propagation constant 
Minimum recommended bend radius 
Nominal loss charact eristics, dB/100 ft 


10 MHz 

Not available 

50 MHz 

Not available 

100 MHz 

2.0 

200 MHz 

2.9 

400 MHz 

4.1 

1 GHz 

7.0 

3 GHz 

13.0 

5 GHz 

18.0 

10 GHz 

27.0 


Price, < 300 meters 


Seven strands of 0.0296-in. silver-covered 
copper; outside diameter 0.089 in. 

Solid polyethylene; outside diameter 
0.285 in. 

Silver-covered copper, braided flat round 
composites 

Black polyvinylchloride 
0.450 in. 

2.0 in. 

0.144 lbs/ft 
50 ohms 

13.7 GHz 

30.8 pf/ft 

-55 to +80 deg C 
5000 volts RMS 
0.659 
2.0 in. 


fa $20/m 
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Table 5. Physical and performance characteristics of F242-W-2400-AOB coaxial cable 


Inner conductor 
Dielectric 
Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Cutoff frequency 

Nominal capacitance 

Maximum operating temperature range 

Maximum operating voltage 

Propagation constant 

Nominal loss characteristics, dB/100 ft 


10 MHz 

Not available 

50 MHz 

Not available 

100 MHz 

2.5 

200 MHz 

4.0 

400 MHz 

6.0 

1 GHz 

8.5 

3 GHz 

15.0 

5 GHz 

19.5 

10 GHz 

37.0 

Price, < 1 km 



> 5 km 


Silver-covered copper; outside diameter 
0.081 in. 

Air; spline polytetrafluoroethylene spacer; 
outside diameter 0.200 in. 

Soldered strip-wound (corrugated) 
silver- covered copper; outside diameter 
0.330 in. 

Fluorinated ethylene propylene 
3/8 in. 

1.0 in. 

Not available 
50 ohms 
20 GHz 
Not available 
-55 to +200 deg C 
5000 volts RMS 
0.80 


$102/m 

$30/m 



Table 6. Physical and performance characteristics of F645-EIA-5160-AO coaxial cable 


Inner conductor 
Dielectric 
Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Cutoff frequency 

Nominal capacitance 

Maximum operating temperature range 
Maximum operating voltage 
Propagation constant 
Nominal loss characteristics, dB/100 ft 


10 MHz 

Not available 

50 MHz 

Not available 

100 MHz 

0.60 

200 MHz 

0.95 

400 MHz 

1.50 

1 GHz 

2.70 

3 GHz 

5.0 

5 GHz 

6.80 

10 GHz 

10.0 

Price, < 1 km 



> 5 km 


Stranded silver- covered copper 

Air; spline polytetrafluoroethylene spacer 

Soldered strip-wound (corrugated) 
silver-covered copper; outside diameter 
1.025 in. 

Fluorinated ethylene propylene 
1.065 in. 

5.5 in. 

Not available 
50 ohms 
6 GHz 

Not available 
-55 to +200 deg C 
5000 volts RMS 
0.79 


S3 $148/m 
S3 $10/ m 
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Table 7. Physical and performance characteristics of HCC-12-50J coaxial cable 


Inner conductor 

Dielectric 

Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Cutoff frequency 

Nominal capacitance 

Maximum operating temperature range 

Maximum operating voltage 

Propagation constant 

Nominal loss characteristics, dB/100 ft 


10 MHz 

0.26 

50 MHz 

0.59 

100 MHz 

0.85 

200 MHz 

1.3 

400 MHz 

1.8 

1 GHz 

2.9 

3 GHz 

5.0 

5 GHz 

7.0 

10 GHz 

10.0 

Price, > 5 km 



Copper-clad aluminum; outside diameter 
0.155 in. 

Air; spiral polyethylene spacer; outside 
diameter 0.338 in. 

Corrugated copper; outside diameter 
0.484 in. 

Black polyethylene 
0.618 in. 

5 in. 

0.16 lbs/ft 
50 ohms 
11.3 GHz 
Not available 
-55 to +80 deg C 
Not available 
0.915 


« $12/m 
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Table 8. Physical and performance characteristics of 64-500 pressurized hardline 

afr-dietectrlc cable 


Inner conductor 

Dielectric 

Outer conductor 
Jacket material 
Cable outside diameter 
Minimum bend radius 
Weight 
Impedance 
Cutoff frequency 
Nominal capacitance 

Maximum operating temperature range 
Maximum operating voltage 
Propagation constant 
Nominal loss characteristics, dB/100 ft 


10 MHz 

0.24 

50 MHz 

0.53 

100 MHz 

0.75 

200 MHz 

1.1 

400 MHz 

1.5 

1 GHz 

2.4 

3 GHz 

4.6 

5 GHz 

6.4 

10 GHz 

10.0 

Price, < 1 km 



> 5 km 


Copper-dad aluminum; outside diameter 
0.167 in. 

Air; six polyethylene tubes; outside 
diameter 0.456 in. 

Aluminum; outside diameter 0.530 in. 
Black polyethylene 
0.550 in. 

5 in. 

0.22 lbs/ft 
50 ohms 
Not available 
Not available 
-55 to +80 deg C 
3.4 volts RMS 
0.855 


fcj $35/m 
a $10/m 
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Table 9. Physical and performance characteristics of 64-875 pressurized hardline 

alr-dielectrlc cable 


Inner conductor 

Dielectric 

Outer conductor 

Jacket material 

Cable outside diameter 

Minimum bend radius 

Weight 

Impedance 

Cutoff frequency 

Nominal capacitance 

Maximum operating temperature range 

Maximum operating voltage 

Propagation constant 

Nominal loss characteristics, dB/100 ft 


10 MHz 

0.13 

50 MHz 

0.30 

100 MHz 

0.43 

200 MHz 

0.6 

400 MHz 

0.86 

1 GHz 

1.4 

3 GHz 

2.7 

5 GHz 

4.0 

10 GHz 

Not available 

Price, < 1 km 



> 5 km 


Copper-clad aluminum; outside diameter 
0.311 in. 

Air; six polyethylene tubes; outside 
diameter 0.837 in. 

Aluminum; outside diameter 0.953 in. 
Black polyethylene 
1.023 in. 

10 in. 

0.46 Ibs/ft 
50 ohms 
Not available 
Not available 
-55 to +80 deg C 
6.0 volts RMS 
0.855 


« $66/m 
~ $18/m 



Fig. 1. Block diagram of the system used to measure cable thermal 
coefficient of delay (TCD) and pressure coefficient of delay (PCD). 



Fig. 2. Example of recorded data for RG-223/U coaxial cable. 


POLYURETHANE 



Fig. 3. Construction of a typical loose-tube fiber-optic cable. 
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Fig. 4. A comparison of the measured TCD of loose-tube, single- 
mode, fiber-optic cable, the best coaxial cable (64-875), and low- 
TCD fiber-optic cable. 



Fig. 6. Measured TCD of RG-223/U coaxial cable. 











Fig. 8. Measured TCD of F242 coaxial cable. 



Fig. 9. Measured TCD of F645 coaxial cable. 
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Fig. 10. Measured PCD of F645 coaxial cable. 



Fig. 11. Measured TCD of HCC-12-50J coaxial cable. 
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AIR PRESSURE COEFFICIENT OF DELAY, ppm/psl THERMAL COEFFICIENT OF DELAY, ppm/°C 



Fig. 12. Measured TCD of 64-500 coaxial cable. 



Fig. 14. Measured TCD of 64-875 coaxial cable. 




Fig. 15. Measured PCD of 64-875 coaxial cable. 


Fig. 13. Measured PCD of 64-500 coaxial cable. 
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This article describes the performance of the all-digital da t a- transit ion tracking 
loop (DTTL) with coherent or noncoherent sampling . The effects of few samples 
per symbol and of noncommensurate sampling rates and symbol rates are addressed 
and analyzed . Their impacts on the loop phase-error variance and the mean time 
to lose lock (MTLL) are quantified through computer simulations. The analysis 
and preliminary simulations indicate that with three to four samples per symbol , 
the DTTL can track with negligible jitter because of the presence of Earth Doppler 
rate. Furthermore , the MTLL is also expected to be large enough to maintain lock 
over a Deep Space Network track. 


I. Introduction 

In modern digital communication systems, analog- to- 
digital conversion (ADC) is performed as far toward the 
front end as possible using available technology. Usually, 
the received signal is amplified and then downconverted to 
the appropriate frequency for digital conversion. There- 
after, various system functions are performed digitally, in- 
cluding carrier, subcarrier, and symbol synchronization, 
as well as signal detection and decoding. Depending on 
the application, either the baseband signals (inphase and 
quadrature) or the intermediate frequency (IF) signed can 
be sampled. Furthermore, the sampling clock can be free- 
running or controlled by the symbol-synchronization loop. 
In the latter case, the sampling clock can be adjusted to 
obtain an integer number of samples per cycle of the IF 
signal, or to obtain an integer number of samples per re- 
ceived symbol. All of these issues affect the final architec- 


ture and design of a receiver and influence the amount of 
cross-coupling among the various loops. 

Since sampling is done up front, the various track- 
ing loops need to be implemented digitally. The classical 
analog integrate-and-dump (I&D) filters, which are typi- 
cally part of the loop arms (inphase and quadrature), must 
be replaced by digital accumulators. This article inves- 
tigates the performance of the all-digital data-transition 
tracking loop (DTTL) with small noninteger numbers of 
samples per symbol. In the previous version of the Ad- 
vanced Receiver (ARX I) [1], the sampling was performed 
synchronously with the symbol rate, and a large number 
of samples per symbol were available. In the current ver- 
sion of the Advanced Receiver (ARX II), the sampling is 
performed asynchronously and the sample clock is indepen- 
dent of the symbol rate. At the highest required data rate 
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of 6.6 Msymbols/sec and the processing rate of 20 MHz for 
the Block V receiver, only about three samples per symbol 
are obtained. 

Some analytical results for the phase-error variance 
of the analog DTTL were first derived in [2], where the 
input was an analog signal and symbol and midphase de- 
tection were performed with analog I&D filters. Later, the 
analysis was reworked [3], taking into account variations of 
the equivalent noise spectrum with respect to normalized 
phase error. 

The interest here is in the loop response and perfor- 
mance of all-digital DTTLs, where digital symbol detec- 
tion and digital midphase accumulation are used. There 
are two sampling scenarios: one is to sample the signal 
instantaneously, and the other is to obtain the sample by 
I&D sampling of the signal. The instantaneous sampling 
technique can be used when the sampling rate is signifi- 
cantly higher than the symbol rate. The I&D sampling 
technique should be used when the number of samples per 
symbol is small. If the received symbol waveform is a per- 
fect square wave, the samples by instantaneous sampling 
all have equal amplitude. The samples by I&D sampling 
also have equal amplitude, except for the first sample of ev- 
ery symbol which has a different polarity from the symbol 
immediately preceding it. These changes in symbol polar- 
ity are referred to as the transition boundaries. The first 
sample after each transition boundary has a smaller am- 
plitude than other samples due to integration across the 
transition boundary. The all-digital DTTL can operate 
on either type of sample. It is worthwhile noting at this 
time that when the received signal is filtered and instanta- 
neously sampled, the process can be modeled to the first 
degree by I&D sampling of an ideal waveform. Thus, I&D 
sampling can be thought of as a tool to model the filtering 
operations in the receiver. 

The components in the all-digital DTTL affected by 
the type of sampling are the symbol detector and the mid- 
phase accumulator. For instantaneous sampling, the sym- 
bol detector accumulates all samples in the current symbol 
epoch. The midphase detector accumulates all samples in 
the current transition-detection window. For I&D sam- 
pling, the problem is slightly different. In this case, even 
when a sample is in the current symbol epoch, most of its 
energy may be from the previous symbol epoch. Therefore, 
a more sophisticated rule is needed to determine whether 
a particular sample should be used for the detection of a 
particular symbol. 

A reasonable criterion is to include a particular sample 
in the current symbol if more than half of its energy is from 


the current symbol. This criterion leads to a simple rule 
for the operation of the symbol and midphase detectors. 
The rule is as follows: the first sample after each symbol 
boundary should belong to the previous symbol if the time 
offset between the sample and the symbol boundary is less 
than half of the sampling interval; otherwise, the sample 
belongs to the current symbol. A sample mark is one-half 
of a sampling interval ahead of its respective sample time 
for I&D sampling, and is the respective sample time for in- 
stantaneous sampling. Thus, the rule can be restated: the 
symbol detector accumulates all samples with their sam- 
ple marks in the current symbol epoch, and the midphase 
detector accumulates all samples with their sample marks 
in the current transition-detection window. Therefore, the 
DTTL with I&D sampling is similar to that with instan- 
taneous sampling if the concept of a sample mark is used. 

To simplify the mathematical analysis, the effects 
of unequal I&D sample amplitude immediately following 
transition boundaries are ignored, and instead equal am- 
plitude for all I&D samples is assumed. 

To illustrate the differences between analog and all- 
digital DTTLs, the noiseless case is considered first. Note 
that if the input is an analog signal, the mid phase inte- 
grator can produce a nonzero error voltage no matter how 
small the phase error is. Thus, a correction voltage can 
be generated at every symbol transition whenever a phase 
error exists. Therefore, the analog DTTL has infinite res- 
olution for phase detection. 

In contrast, the all-digital DTTL has only finite res- 
olution for phase detection. This is illustrated in the fol- 
lowing example. Suppose that there is an even number 
of samples per symbol. When a symbol transition occurs, 
the digital midphase accumulator can produce a nonzero 
voltage only if the phase error causes sample slipping (as- 
suming samples of equal amplitude). As long as the phase 
error stays within a range of values that avoids sample 
slipping, the loop always generates a no-error signal. This 
range of undetectable phase errors accounts for the finite 
resolution of the all-digital DTTL. The more samples per 
symbol that are used, the higher the achievable resolution, 
and the closer the all-digital DTTL is to its analog coun- 
terpart. A key question is the impact of the all-digital 
DTTL’s finite resolution on the phase-error variance for 
few samples per symbol (say, four or five samples). 

Another issue in an all-digital implementation is the 
effect of a noninteger number of samples per symbol. If 
the sampling clock is driven by the symbol-synchronization 
loop, the number of samples per symbol can be made an 
exact even integer, which reduces the self-noise generated 
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in the midphase accumulator (as discussed later). Under 
that sampling scenario, the sampling clock is constantly 
adapting as the data rate changes due to Doppler or other 
effects. One disadvantage of that scheme is that no fixed 
time base is available in the system. On the other hand, 
if the sampling clock is free-running and is derived from 
a fixed frequency standard, the sampling period is fixed, 
although the symbol rate may change. This may result in 
a noninteger number of samples per symbol. A model is 
derived in this article to analyze the performance of the 
DTTL where the sampling rate and the data rate are non- 
commensurate. Other issues such as the mean time to lose 
lock (MTLL), probability of symbol error, probability of 
losing lock, and error variance are also investigated via 
simulations. In Section II, a general analysis of the loop is 
presented in handling several scenarios. A discussion and 
comparison with simulation results are given in Section III, 
and the conclusion is given in Section IV. 

II. Analysis 

The performance of the all-digital data-transition 
tracking loop with noncoherent sampling is analyzed here. 
The block diagram of the all-digital DTTL is delineated 
in Fig. 1. The input r(f) to the DTTL can be obtained 
by instantaneous sampling or by IfcD sampling. For the 
Advanced Receiver II, the number of samples per symbol 
becomes small at high data rates, and therefore the I&D 
sampling technique is used. In the subsequent derivation, 
equal- amplitude samples are assumed. 

Noncoherent sampling means that the sampling clock 
runs independently of the estimated symbol phase, i.e., 
the sampling time interval and the sampling time do not 
change with the estimated symbol phase. This is not an 
issue if there are many samples per symbol. The problem 
becomes complex as the number of samples per symbol 
decreases. The proposed Advanced Receiver II has about 
three to four samples per symbol at high data rates (the 
goal is 6.6 Msymbols per sec). Noncoherent sampling re- 
sults in a noninteger number of samples per symbol. All 
of these factors affect the performance of the DTTL by 
changing its S-curve and by introducing self-noise. Con- 
sidered here are the probability of loss of lock, the MTLL, 
the degradation of the symbol detection, and the phase- 
error variance. An approximate theory is presented for a 
first-order DTTL. The approach is to derive the S-curve 
and then solve the Fokker-Planck equation to get the den- 
sity function of the phase error. The phase-error variance 
and the degradation of the symbol detection can be eval- 
uated from the phase-error density function. 

To illustrate the phenomenon of self-noise, a simple 
example is shown in Fig. 2, where there are five samples per 


symbol. Assumed are no thermal noise and perfect track- 
ing at a particular moment. The output of the symbol- 
transition detector is not zero because it sums three sam- 
ples from the first symbol and two samples from the second 
symbol (Fig. 2a). Notice that this situation occurs for ev- 
ery symbol interval as long as the loop maintains perfect 
tracking. The nonzero output of the loop filter will gradu- 
ally drag the loop away from the perfect tracking condition 
until the polarity of the output of the symbol-transition de- 
tector changes (Fig. 2b). The loop filter cannot eliminate 
this type of self-noise. The problem is more complex if the 
number of samples per symbol is not an integer. In order 
to describe this phenomenon, three useful parameters are 
introduced here. Let /? denote the number of samples per 
symbol, which may not be an integer, and a denote the 
offset of the first sample mark in a received symbol from 
the symbol boundary. By convention, a is normalized and 
is measured as a percentage of the sampling interval. If /? 
is an integer, a remains constant; if (3 is not an integer, 
a varies from symbol to symbol. Let the received symbol 
be numbered 0, 1, 2, . . and let the value of a at the first 
symbol be denoted as c*o = 7, which is referred to as the 
initial sampling offset. The values of a at the subsequent 
symbols, namely, c*i, a *, . . can be computed from /? and 
7. The number of sample marks in a transition-detection 
window and the number of sample marks in a symbol- 
detection window are functions of a. Thus the output of 
the symbol detector and that of the transition detector 
fluctuate from symbol to symbol as a,. This subject will 
be further discussed later. 

Another important observation about the DTTL with 
noncoherent sampling is that there is inherent phase error 
due to finite samples per symbol. To illustrate this phe- 
nomenon, consider the example shown in Fig. 3, where 
every symbol contains four samples. As long as the es- 
timated phase lies between t\ and t 2r the error signal is 
always zero (or nearly zero if the received symbol does not 
have a perfect square waveform or if there are unequal- 
strength samples from I&D sampling), and the DTTL re- 
mains in tracking. However, unresolved phase ambiguity 
still exists within the interval from t\ to t 2 . Mathemati- 
cally, this phase ambiguity can be explained by a step-like 
S-curve. This phenomenon might have little effect on sym- 
bol detection performance if straight accumulation is used 
to detect the symbols. However, if weighted accumulation 
is used to detect the symbols, the phase ambiguity can in- 
troduce misweighting and thus degrade performance. The 
symbol-error probability can be obtained by simulation. 

Before proceeding to the mathematical analysis, ex- 
amine the all-digital DTTL block diagram again in Fig. 1. 
The error-signal accumulator between the loop filter F(z) 
and the multiplier performs an averaging function so that 
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the subsequent loop filter can operate at a slower speed. 
The loop bandwidth is determined primarily by the loop 
filter F(z ). Thus the presence of the accumulator is sim- 
ply for hardware convenience. In the following analysis, 
the DTTL is considered without the error-signal accumu- 
lator. 


A. Mathematical Model 

Assuming that the carrier and subcarrier (if any) have 
been removed in an ideal fashion, the received waveform is 
given by 

r(t) = VS ^ a* p(i - kT ) + n(t) 

k 

where S is the data power, n(f) is white Gaussian noise 
with two-sided power spectral density Nq/2 W/Hz, a* = 
±1 represents the polarity of the kth symbol, and p(t ) is 
the square- wave function having value 1 for 0 < t < T 
and having value 0 elsewhere. With IfcD sampling, the 
zth sample can be expressed as 

r(i) = VSa k + n(i) (1) 

where it is assumed that the ith sample is derived from the 
jfcth symbol, n(t) is a zero-mean Gaussian random variable 
with variance <r 2 = iV 0 /(2T,), and T s is the sampling inter- 
val. Note that equal sample strength is assumed in Eq. (1). 


Let the phase error A (in cycles) be defined as 



where 9 is the actual received symbol phase and 6 is the 
estimated symbol phase. Note that A should have a value 
between —0.5 and 0.5. The error signal is affected by the 
locations of samples within their respective received sym- 
bols. In order to quantify this effect, a set of twelve A 
functions is defined, six for the A > 0 case and six for 
the A < 0 case. They are the numbers of sample marks 
contained in their respective intervals defined in Fig. 4(a). 
The output of the inphase accumulator x(k) and the out- 
put of the midphase accumulator y(k) can be expressed in 
terms of the A functions. If A > 0, then 

x(fc) = y/S(A\ak + A 2 afc+i) + ni(k) + n 2 (fc) + *13 (k) 
x(k -f 1) = \/5(A3ajb+i + A4ajt+ 2 ) + n 4 (i) + n$(k) + ne(k) 

and 

y(k) = VS(A^ak + Aeajt+i) 4- (k) + 713 (k) 4- n 4 (fc) 

( 2 ) 

where 7 ij(k) } 1 < j < 6 are zero-mean Gaussian ran- 
dom variables with their respective variances (Ax - A 5 )<r 2 , 
A 5 <t 2 , A 2 <t 2 , (A 6 - A 2 )cr 2 , (A 3 4- A 2 - A 6 )<r 2 , and A 4 <r 2 , 

The A functions are computed using the following 
equations: 


Ax = \fi- ot\ - |A/? - <*J 


A 2 =l(l + \)p-a\-[p-a\ 
A3 = \2(3 — aj — [(1 + A)/? — aj 
A 4 = [(2 4- A)/? - aj - [2/3 - aj 


A 5 = 


' l/?-oJ - L(1 + A - (W/2))/? - crj 
0 


if W/2 > 1 4- A 
if W/2 < 1 + A 


f |_(1 4- A 4- ( W/2))/? — aj — \J3 — aj if W/2 > 1 + A 

[ L(1 + A + (W/2))/? - aj - L(1 + A - (W/2))/? -a] if W/2 < 1 + A 


( 3 ) 
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if A > 0, and 


Ai — [(1 + A )fi — Qfj + 1 


Aa = 


L— A/? + aj 

L-A/J + aj +1 


if — \p + a is not an integer 
if —A (3 a is an integer 


A3 — [_(2 + A)/? — oc] — \fi — a j 
A;= 10-cx\ -L(l + A)/?-aJ 




L/?-o| - 1(1 + A - (VT/2))/? - aj if W/2 > -A 

L(1 + A + (W/2))/? - aj - |_(1 + A - (W/2))/? - aj if W/2 < -A 


f [ (1 + A + (W/2))/3 - aj - IP - aj if W/2 > -A 
[ 0 if W/2 < -A 


(4) 


if A < 0, where [y\ is the greatest integer strictly less than 
V • 

In the above equations, W is the width of the 
transition-detection window. The derivations of the twelve 
A functions are similar. Two examples are given here, A2 
and A3. To derive A 2) the beginning of the Arth received 
symbol is used as the reference point. The number of sam- 
ple marks in the k th received symbol is \J3 — ct \ . The 
number of sample marks from the beginning of the kt h re- 
ceived symbol to the end of the k th estimated symbol is 
[(1 + A)/? — aj . Equation (3) follows by observing that the 
number of sample marks from the end of the kth received 
symbol to the end of the kth estimated symbol is A 2 . To 
derive A3, the beginning of the Arth received symbol is also 
used as the reference point. The number of sample marks 
in the kth received symbol is The number of sam- 

ple marks from the beginning of the kth received symbol to 


the end of the (A-fl)th estimated symbol is [(2 + A)/? — aj . 
Equation (4) follows by observing that the number of sam- 
ple marks from the end of the kth received symbol to the 
end of the ( k -f l)th estimated symbol is A3. 

The error signal e(k) is given by 

e(k) = z(k)y(k) 

The conditional S-curve is defined by 

g(\\a) = E ntS {e(k)\\,a] 

where E nfS represents the conditional expectation on A 
with respect to the noise and the signal. Following sim- 
ilar mathematical manipulation as in [2], 
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4g{e(fc)|A_> 0, a} = ^( erf (^3 + A 4 )£,//?) + erf((A 3 - A 4 ) x /£ s /(/?(A 3 + A 4 ))) j 


As * — erf (^(A! + A 2 )£,//?) 


A 5 — Ag 
0 


erf ^(Ai — A 2 )v/£ , ,/(/?(A i 4- A 2 


Ag — A 2 
^7t/3(A 3 + AfiE t 


f f A 3 -h A 4 ^ / (A 3 - A 4 ) 2 

:|exp^ E s J -f exp — rrT5 


(A 3 + A 4)/? 


A 2 + A 5 

/ 5 ( A 1 + A 2 )£/, 


,{ exp (_^l±^i E ,) +„p(-i|i T |^£,)} 


where E s is the symbol energy- to- noise ratio, namely, 


Using the same approach yields a similar result for the A < 0 case: 


4£'{e(^lA_< 0,o} _ A 5 + 4 . S erf( v /(A , 3 + A 


At - At 


erf (Ai-A')Am-A')) 


- *± (erf (^(A; + A' 7 )EJ^ +erf((A; - A' 2 )yJ E,/(f3(A\ + A' 2 


A 4~ A 6 f f A 3 + A 4 p ^ 4 - exn / f _( A 3 ~ A 4) 2 

/jt/?(A£ + A’jE, 1 P *' ' ( A 3 + A i)/^ 


>3 T ^4 r 
~ £ ‘ 




A' B + Ai f / A'j + A '2 (a; - A ' 2 ) 2 f 

t,K ai + Ai)B.i exp (' /» •) P ' (A'. + AiW ' 
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Observe that #(A|a) is the (unconditional) S-curve if 
a is a constant. If a changes rapidly from symbol to sym- 
bol, the loop filter will smooth its effect on the error signal. 
Therefore, the S-curve is obtained by averaging the above 
equations over a certain set of values of a, which is de- 
termined by the initial sampling offset and the number 
of samples per symbol. This problem is addressed in the 
subsequent discussion. 

If j3 is an integer, the value of a remains constant 
from symbol to symbol, namely, a* = 7 for all i. For this 
case, the S-curve can be determined for a given 7. When 
7 is 0.5, the S-curve is centered; otherwise, the S-curve is 
biased slightly to one side. The phase error is certainly 
biased if the S-curve is biased. If /? is an odd integer, the 
error signal is not zero when the phase error is zero. This 
is a source of self-noise, as discussed before. 

Next, consider the effect of a noninteger number of 
samples per symbol on the S-curve. To illustrate the con- 
cept, assume that /? = 4.1. Suppose that the initial sam- 
pling offset 7 is 0.7. Clearly, 07 == 7 = 0.7, aq = 0.6, 
c*2 = 0.5, <*3 ~ 0.4, a 4 = 0.3, 05 = 0.2, as ~ 0.1, <*7 = 0, 
a 8 = 0.9, <29 = 0.8, a\o = 0.7, and so on. Consider a 
system with a normalized symbol rate of 1 Hz and a one- 
sided loop filter bandwidth of 0.05 Hz. The error-signal 
fluctuation due to variation of a is averaged by the loop 
filter in the same way as the fluctuation due to thermal 
noise. Therefore, the S-curve is obtained by averaging the 
error signal with respect to noise and all possible values of 
a. For the example given here, the set of values for a is 
{0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9}. Figure 5 shows 
the conditional S-curve for the /? = 4.1 case with a = 0.5. 
Figure 6 shows the unconditional S-curve for the /? = 4.1 
case after averaging over values of a belonging to the set 
{0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9}. There is a 
phase ambiguity area in the conditional S-curve in Fig. 5, 
i.e, zero error signal for nonzero phase error. The phase 
ambiguity is removed in the S-curve in Fig. 6 due to aver- 
aging over a. Notice the small bias of the S-curve due to 
the particular set of values for a. 

Another example is the /? = 4.11 case. The fractional 
part of /?, i.e., 0.11, can be decomposed into the two com- 
ponents 0.1 and 0.01. Both components contribute to the 
values of a,-. For instance, if e*o = 7 = 0.7, then 07 = 
0.59 = 0.7 — 0.1 — 0.01, and o 2 = 0.48 = 0.7-0.2-0.02, . . . 
and so on. The same loop bandwidth is assumed as be- 
fore. The variation of a due to the component 0.1 changes 
quickly relative to the loop bandwidth, and the variation 
due to the component 0.01 changes slowly relative to the 
loop bandwidth. The S-curve is obtained approximately 
by averaging the error signal over all possible values of a 


due to the fast component. The initial sampling offset will 
drift slowly from time to time due to the slow component. 
The slow component does not affect the instantaneous S- 
curve, but it does affect the S-curve gradually by changing 
the initial sampling offset. Therefore, the slow component 
does contribute to the overall phase-error density function. 
Note that the conditional density function for the phase er- 
ror can be obtained for any given initial sampling offset. 
The phase-error density function can then be derived by 
averaging the conditional density function over the initial 
sampling offset. The distribution function for the initial 
sampling offset can be assumed to be uniform between 0 
and 1 or can be determined by simulation. 

Partitioning the fractional part of (3 into slow and fast 
components can only be done approximately, and they are 
determined by the loop bandwidth. Let (3 = np + fpi 
+ fp, 2 } where np is the greatest integer less than or equal 
to /?, fp } 1 is the fast component, and fp >2 is the slow com- 
ponent. The choice of fp t 1 and fp }2 is made solely by expe- 
rience. In the following discussion, a criterion is provided 
for justifying the choice of fp t \. 

Let 7 be the initial sampling offset. The set of values, 
denoted by Dp n which {a*} can take on, can be deter- 
mined from 7 and (3 using the following procedure: sup- 
pose that fp ti contains k digits after the decimal point (for 
instance, if fp } \ = 0.15, then k = 2). The basic incremental 
unit gp is defined as 

g g = GC DClQ^.x, 10 k ) 

10 k 

where GCD(a,i) is the greatest common divisor between 
a and b. Then Dp n is given by 

Dp n - {7 + mg 0 \O <m< (1 /gp) - l} 

For instance, if 7 = 0.1 and fpp = 0.15, then gp — 0.05, 
and Dp >() A = {0.1 -f 0.05m|0 < m < 19}. In terms of Dp n , 
the S-curve is given by 

{g(^l a )} 

where the expectation is performed with respect to all val- 
ues of a in Dp t 7 , which are assumed equiprobable. Let the 
one-sided loop bandwidth be Bl and let the symbol rate 
be R . A valid choice for fp t 1 is to ensure that all values in 
Dp n can occur in a If Bl time interval, i.e., to satisfy the 
following equation: 

B L —Rgp (5) 
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B. The Density Function and Variance of 
Phase Error 

The steady-state Fokker-Planck Eq. (2) is given by 
^{a(A| 7 )P(A| 7 )} = \ ^{s(A| 7 )P(A| 7 )} (6) 


In the above equation, 

where 5(u>,A|7) is the spectrum of the noise n.\(<) defined 
as 


n x (t) - E ntSi 0 , cAfi>y {e(k)e(k + m)|A, 7 } - (p(A| 7 )) 
The solution to Eq. (6) is of the form 

f A 2A(y\y) - dB ^ 


P(A| 7 ) = Cexp 


/ 


S(y| 7 ) 


■dy 


( 7 ) 


In order to use the above equation, 5(0, A (7) must be 
found, which is a fairly complex task. In the subsequent 
derivation, it is assumed that 5(0, AI7) = WNqT/A. Thus 
Eq. (7) can be simplified to 


P(A| 7 ) = C exp 


2 E s 


L B L RW J 0 


Jo 


\dy 


( 8 ) 


The phase-error bias P{A|7} is given by 

/•1/2 

E{A| 7 } = / AP(A| T ) 

J- 1/2 


dX 


The mean-square phase noise cr 2 (A| 7 ) is given by 


^(A| T ) = P{A 2 |7 }-(P{A | T }) 2 


The phase ambiguity phenomenon is a direct result of 
Eq. (8). Note that if g(y\j) = 0 for —61 < y < e 2 , the 
phase error is uniformly distributed between — ei and e 2 
when E s approaches infinity. 


III. Discussion and Numerical Results 

Figure 7 shows the phase-error variance versus symbol 
signal-to-noise ratio (SNR) with an even integer number of 
samples per symbol. Notice that the phase-error variance 
approaches a limit as symbol SNR increases for the given 
number of samples per symbol. That limit of the phase- 
error variance is due to the phase ambiguity; thus it cannot 
be eliminated by increasing the symbol SNR. The phase 
ambiguity decreases as the number of samples per symbol 
increases. 


Note that the phase ambiguity phenomenon may have 
an effect on the performance of weighted symbol detec- 
tion. For illustration, Fig. 8 shows simulation results of 
the MTLL of the all-digital DTTL for various symbol 
SNRs and for 4 and 100 samples per symbol. The plot 
depicts normalized MTLL, which is the MTLL times the 
loop bandwidth. Notice that it usually takes a long time to 
simulate the MTLL performance; therefore, the loop oper- 
ation was purposely simulated at a very low loop SNR (on 
the order of 3 to 9 dB) to guarantee loss of lock within a 
“practical” time period. It is clear that the four samples 
per symbol case (/? = 4) loses lock more often than the 
f3 = 100 case. In the DSN, the symbol loop SNR is so high 
that the loop is expected to maintain lock over a whole 
track. It is still expected that the MTLL for the (3 = 4 
case will be less than for the (3 — 100 case, but both of 
these will be large enough that lock is maintained over a 
whole track. 

In a practical communication system, Doppler and 
Doppler rate are present due to the relative motion be- 
tween transmitter and receiver. The effect of the Earth 
Doppler rate on a symbol rate of 6.6 Msymbols/sec is about 
1 mHz/sec, which is enough to guarantee that the number 
of samples per symbol will not remain an exact integer 
for long. Consider a scenario designed for (3 — 4 samples 
per symbol, but due to Doppler rate, the actual number 
of samples per symbol is j3 — 4.0000001. In this scenario, 
the basic incremental unit is gp = 10“ 7 . When the DTTL 
is operating with a 1-mHz-loop bandwidth, the time con- 
stant of the loop is about 1000 sec or 6.6 Gsymbols at a 
symbol rate of 6.6 Msymbols/sec. Since the loop is effec- 
tively averaging over all those symbols, the effect of the 
10“ 7 basic incremental unit will be enough to smooth the 
composite S-curve as discussed earlier. This is because 
with a time constant of 1000 sec, RgpjBi, = 6.6 Gsym- 
bols xl0“ 7 = 660 1 (Eq. 5). This effectively smooths 

the S-curve so that the digital loop behaves like its equiv- 
alent analog counterpart. For a loop time constant of 
1 sec (1-Hz loop bandwidth), Rgp/Bi = 0.66. However, 
with a time constant of 0.2 sec (5-IIz loop bandwidth), 
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Rgp/B L ~ 0.132; therefore, self-noise might become con- 
siderable. But that case would still exhibit less self-noise 
than the exact four samples per symbol scenario. So over- 
all, the Doppler rate helps in reducing the self-noise. De- 
pending on the actual parameters, the self-noise degra- 
dation might become negligible. More simulations with 
Doppler rates are planned to verify this concept. 


IV. Conclusion 

The all-digital DTTL with coherent or noncoherent 
sampling is analyzed in this article. Two sampling schemes 


are considered, i.e., instantaneous sampling and I&D sam- 
pling. The theory presented here is valid for both sam- 
pling schemes. The effects of few samples per symbol and 
of noncommensurate sampling rates and symbol rates are 
addressed and analyzed. The phase ambiguity problem 
due to a small number of samples per symbol is illustrated, 
and it is shown that the phase ambiguity can be alleviated 
when there is a noninteger number of samples per symbol 
and the loop filter has appropriate bandwidth. A closed- 
form expression for the S-curve is derived for any number 
of samples per symbol. Finally, the interplay between the 
loop bandwidth and the number of samples per symbol in 
the reduction of self-noise is shown. 
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Fig. 1. Block diagram of the all-digital DTTL. 
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Fig. 2. Effect of odd number of samples per symbol: (a) local reference waveform Is ahead 
of the received waveform, (b) local reference waveform Is behind the received waveform. 
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Fig. 3. The phase ambiguity phenomenon. 
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The Advanced Receiver currently being developed uses a Costas digital loop 
to demodulate the subcarrier. Previous analyses of lock detector algorithms for 
Costas loops have ignored the effects of the inherent correlation between samples 
of the phase-error process . Accounting for this correlation is necessary to achieve 
the desired lock-detection probability for a given false-alarm rate. In this article , 
both analysis and simulations are used to quantify the effects of phase correlation 
on lock detection for the square-law and the absolute- value type detectors. Results 
are obtained which depict the lock-detection probability as a function of loop signal - 
to-noise ratio for a given false-alarm rate. The mathematical model and computer 
simulation show that the square-law detector experiences less degradation due to 
phase jitter than the absolute- value detector and that the degradation in detector 
signal- to-noise ratio is more pronounced for square-wave than for sine-wave signals. 


I. Introduction 

Costas loops are being used extensively in modern co- 
herent communication systems to track both subcarriers 
and suppressed carriers. In many applications, residual 
carriers are being replaced by suppressed carriers as the 
latter dedicates the total transmitted power to both car- 
rier tracking and symbol detection simultaneously. This 
has a clear advantage over residual carrier tracking, which 
requires a fraction of the total transmitted power delegated 
solely to that purpose, and hence, reduces the available 
power that can be used for symbol detection. The dis- 
advantages of Costas loops are that they require symbol 
synchronization and suffer from an additional loss factor 


typically referred to as squaring loss, which is highly depen- 
dent on symbol energy- to-noise ratio. Squaring loss is the 
result of forming the product of the inphase and quadra- 
ture signals to wipe out the data modulation, in order to 
obtain a feedback error signal that is only a function of 
the instantaneous phase error [1]. Another disadvantage of 
suppressed carrier tracking is the issue of false lock, which 
occurs either as a result of accumulated delay [2] or during 
acquisition with frequency uncertainty greater than one- 
half the symbol rate [3, 4]. The latter can be detected 
through a false-lock indicator, as described in [3]. 

Lock detection is an important part of a tracking 
loop’s operation and monitoring, as it provides an insight 
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into the tracking loop’s behavior in real time. Lock de- 
tection basically serves as a binary indicator of whether 
the loop is tracking the received signal or not, and during 
loop start-up it also indicates whether or not the loop has 
acquired the phase of the signal. There are mainly two 
kinds of lock detectors for Costas loops, the I 2 — Q 2 , or 
square-law detector, and the | / | — | Q |, or absolute- value 
detector [5]. Both have been analyzed in the past at high 
loop signal-to-noise ratio (SNR), which basically assumes 
zero phase-tracking error. At low loop SNR, the assump- 
tion of zero phase jitter becomes inadequate and results in 
system operating parameters that are different from their 
design counterparts. Thus, a new model is required which 
has to account for the phase jitter and the correlation be- 
tween samples of the phase-error process. The latter is es- 
sential in an accurate performance prediction analysis, in 
order to operate the system at the desired lock-detection 
probability for a given false-alarm rate. The analysis of 
the detectors, including the phase correlation and assum- 
ing either a sinusoidal or a square wave signal, is presented 
in Section IT. In the case of a square-wave, the analysis is 
general and includes any windowing operation as described 
in [6]. The discussion of the results and some simulated 
data are shown in Section III, followed by the conclusion 
in Section IV. 


II. Lock-Detection Analysis 

Suppressed carrier tracking for binary-phase shift 
keyed (BPSK) signals can be accomplished using a squar- 
ing loop or a Costas loop [7]. The squaring loop relies on 
wiping out the data by a squaring operation and tracking 
the resulting residual double-frequency component with a 
classical phase-locked loop. The squaring loop does not 
require symbol timing, but results in an additional noise 
term which becomes dominant at low SNR. On the other 
hand, the Costas loop implements a phase discriminator 
by forming the product of the inphase and quadrature sig- 
nals. That too results in some degradation, commonly 
referred to as the squaring loss. Depending on the de- 
sign, both loops can be implemented with identical per- 
formances for all practical purposes. The Costas loop has 
various derivatives, each approximating the maximum a 
posteriori (MAP) estimator at different SNR s [8]. For ex- 
ample, a hard-limiter can be included in the inphase arm 
to estimate the current symbol, and that results in less 
squaring loss at high SNRs. 


dump arm filters are digital accumulators. The IQ loop 
and square-law lock detector are depicted in Fig. 1 for the 
square-wave case, with the optional windowing operation 
on the quadrature channel. The analysis that follows will 
be applicable for both sinusoidal and square waves. The 
received waveform is digitized to produce the samples rj } 
which are subsequently digitally mixed with the inphase 
and quadrature references. The outputs of the mixers, 
running at the sampling rate, are accumulated to detect 
the received symbols. It is assumed that there are L sam- 
ples per symbol and that perfect symbol synchronization 
has been achieved. The accumulator outputs, now at the 
symbol rate, are multiplied together to wipe out the data 
and again accumulated to reduce the processing rate even 
further. The output, running at a new rate (referred to 
as the loop update rate), is the input to the digital loop 
filter which provides a frequency estimate to adjust the 
phase of the numerically controlled oscillator (NCO). The 
lock detector processes the arm accumulator outputs at 
the symbol rate, accumulates the result over M symbols, 
and provides a binary decision on the loop status. 

In the I and Q branches of Fig. 1, the signals accu- 
mulated over L samples during the k th symbol interval are 
given by 

xn — dkLy/Po^k + njk k = 1, . . . ,Af 
and (1) 

XQk = dkLs/Po Vk + nqk k — 1 , . . • , M 


where 


Wfc - (l- I «Jt l) , Uk = ^-<pk, I <t>k |< 7T 


Vk = < 


«Jfc 

sgn(<£ t )W 


\<f> k \< *W/2 ( 2 ) 

trW/2 <| <j>k |< tt(1 - W/2) 


[ 2sgn(^i) - u k 7r(l - W/2 ) <| <f> k \< * 


for a square-wave subcarrier and 


UJ k = COS <P k 


This article is concerned with the lock detection for 
the all-digital IQ loop, which is also a derivative of the 
Costas loop with integrate-and-dump arm filters. All- 
digital refers to the fact that the input waveform to the 
loop is a sequence of samples and that the integrate-and- 


Vk 


( sin^jt \ <j> k \< irW/2 

J sin (<f> k W) vW/ 2 <| <t> k |< ?r(l - W/2) 


( 3 ) 


l sin (j>k jt(1 - W/2) <| </> k \< n 
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for a sine-wave subcarrier, where Pd is the average data 
power, dk is the data value of the k th binary symbol (±1 
equally likely), <f>k is the subcarrier phase-estimation error 
(radians) at time k , W is the width of the window in the 
Q channel (i.e., the fraction of cycle of the reference signal 
which has nonzero value W < 1 [W = 1 means no window 
is used]), and ft/jfc, riQk are zero-mean white Gaussian noise 
samples. From Eq. (1), the mean values and the variances 
of x/k and XQk conditioned on <fi k and d k are given by 

A*/* = dkL\fPnu)k <r i = L<7n 

and (4) 

VQk = d^Ly/Po v k <Tq — WLa 2 

where a 2 = NoB n is the noise variance of a received sam- 
ple, No is the one-sided noise spectral density, and B n is 
the Nyquist bandwidth. The equations that follow are ap- 
plicable to both sine and square waves. 

A. Square-Law Detector 

The first algorithm considered detects the in-lock state 
by producing a signal that is proportional to the cosine 
of the phase error (in the case of a sinusoidal wave) and 
averaging it over several symbols before comparing it to a 
threshold r. Referring to Fig. 1, 

M 

y* < r > where yk - - x hk ( 5 ) 

fc=i 

An estimation of the performance of this lock detector re- 
quires the first and second moments of x 2 Jk and Xq k con- 
ditioned on <f> jb, which are readily obtainable: 


Z/fc - I* Ik + <4 ( 6a ) 

X Qk = VQk + Oq (6b) 

x ik = Vik + QfijkCr] + 3(7/ (7a) 

= P%k + + 3(7^ (7b) 


where m k , HQk, and <7^ are given by Eq. (4). Using 
Eqs. (6) and (7), the variances of x 2 Ik and Xg k become 
respectively 


and 

var(i^i) = 4 iiQ k cr 2 Q + 2a% (8b) 

The mean value of yk is obtained from Eqs. (4) and (6) in 
Eq. (5), namely 

ti yii = DP D ( U l-vl) + La 2 „(l-W) 

The variance of t/*, o 2 k , will be the sum of variances of xj k 
and XQ k1 and is obtained by using Eq. (4) in Eq. (8), i.e., 

< = 2L 2 ^ + Wvl) + 1 + W 2 

i *7 n 

The lock-detector signal z is the accumulation of Myk sam- 
ples, which are highly correlated due to the phase-error 
samples. The mean value and variance of z, \i z , and a 2 
are derived in the Appendix, where it is shown that 

fi z =M^L 2 P D d + Lal(l-W)) (9) 

and 

a\ = M 2 L A P 2 D {g - d 2 ) + AML 3 P D a 2 n {f + hW) 

+ 2ML 2 <t a (1 + W 2 ) (10) 

The parameters d } /, h ) and g depend on the waveform 
type and are given by ( b = Wn/2) 

— 

/=0-im + ^F) 

n / j \ 2 i AT 1 

s = i-£m+(-) w E «(*)<«*) 

v 7 k = 0 

where 


var ( x /fc) — (8a) 
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and 


for a square wave and 



for a square- wave sub carrier, and 
d = exp(— 2 <r£) 

/ = 0.5 (1 + exp(-2<r£)) 


0.5^1 + exp(— 2<r^)^ for k = 0 

d(ifc)=l 

k exp(-4<rpcosh^2<T^C(rt)^ k = 1, 2, . . . , M - 1 

for a sine wave; (7(7* ) is the correlation function of the 
phase-error process in the tracking loop, which is assumed 
to be of the form given by Eq. (A-9) [9], and the second- 
order joint probability density function of the correlated 
phase process <f> 2 } r*) is assumed as in Eq. (A-10). 

When the loop is in-lock and assuming high loop SNR, 
<f>k — ► 0 for all k. Hence w* — + 1 and v\ — ► 0, and the above 
mean value and variance of the detector’s signal simplify 
to 

fi z = m(l 2 P d + LctI{1-W)) 


2 ,,„2 4 LPd , l + W 2 

= 4ML*<r* — r + 

<J n Z 



M— 1 

X c(k)d(k) 


k = 0 


which are true for both square-wave and sine-wave sig- 
nals. Note that in the above equation the following rela- 
tion holds: 



for a sine- wave subcarrier. Note that Eqs. (11) and (12) 
specify the variance of the phase jitter, assuming the linear 
loop model. For example, at 15 dB of nominal loop SNR, 
the actual variance can be about 1 dB larger. Nominal 
loop SNR p is defined as 1 /<r^, where < 7 ^ is obtained from 
the linear model; B sc is the one-sided noise bandwidth of 
the Costas loop, and E s /Nq is the symbol energy-to-noise 
ratio. The constants c(k) and d(k) are defined as follows: 

M for k = 0 

2(M — k) for k = 1, 2, . . . , M — 1 

for both waveforms, but 


because L — 2 T 9 B n (Nyquist sampling), where T s is the 
symbol time, — NoB n , and T 3 Pd = E Sy the energy per 
symbol. 


B. Absolute-Value Detector 

For the absolute- value detector, the squaring opera- 
tion in Fig. 1 is replaced by an absolute-value operation, 
and the algorithm defining the new lock detector becomes 


M 

Y2 V k < T ' where Vk = 1 x Ik | - | %Qk | (13) 

Jt = l 



<t > 2 | P(^l,^2) T 'jb)#l#2 


In order to estimate the performance of this lock detector, 
the first and second moments of | xjk | and | xq* | are 
needed, again assuming a white Gaussian noise process at 
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the phase-locked loop input. These moments, conditioned 
on <£*;, become 

I z/fc | = r lk 



The second moments of | xj k | and | xq k | are identical 
to the second moments of x and xq k} and are given by 
Eq. (6). The mean value of y k follows from Eqs. (13) and 
(14): 

A 

Vyk = r Ik ~ r Qk 



(15) 


and the variance of y k is 

CTyfc = L 2 Pd (wj t + vl) + Lcrl{ 1 + W) - (rTT 2 + rQk 2 ) 

The lock detector’s signal 2 is again obtained by adding 
My k samples. The mean and variance of z are found by 
averaging the first two moments of z over the correlated 


phase process in the tracking loop. This is carried out in 
the Appendix and gives 



(16) 


a] = m[l 2 P d (w 2 + t/ 2 ) + I<x 2 (l + HQ) 

+ + r Q> r Qi ) ~ M 2 (rj 2 + 7q 2 ) 

all i,j 
i*3 

(17) 

The bar over the product terms denotes expectation over 
the joint probability density function of <f>j , assumed 
to be of the form given by Eq. (A-10), and r Ik} rq k are 
defined by Eq. (14). Because no closed-form solutions for 
the above averaging operations are known, the averaging 
was done numerically. When the loop is in-lock and at 
high loop SNR, <j>h — < ► 0 for all k. Hence, — ► 1 and 
v l Oj and the above mean value and variance of the 
lock detector simplify to 
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C. Probability of Detection and of 
False Indication 

During subcarrier detection, each z sample is com- 
pared with a predefined threshold r, and the lock detector 
decides that the loop is in-lock when z > r. It is possible 
that even when no signal is present., z will occasionally be 
larger than r. In this case, the lock detector will mistak- 
enly declare an in-lock condition. The probability of false 
indication is 



where /i* o and cr^g are the mean and variance of the lock- 
detector signal in the out-of-lock state, and erfc(x) is the 
complementary error function (erfc(x) = 1 — erf(x), where 
erf(x) is the error function defined in the Appendix). For 
the square-law detector, fi z o and (j^ Q are obtained from 
Eqs. (9) and (10) by making P D = 0 (or, equivalently, 
assuming that u and vj i n Eq. 4 are zero), namely 

ti z0 = ML* 2 n (l-W) 
a\ 0 = 2ML 2 ^(1 + W 2 ) 

whereas for the absolute- value detector, Eqs. (18) and (19) 
result in 



Given a desired probability of false indication Pj , the 
threshold r is obtained by solving Eq. (20) and setting 
it equal to 

r = \j2crl 0 erfc” 1 (2 Pj) -f /i z0 ( 22 ) 

where erfc -1 ^) is the inverse complementary error func- 
tion. When the loop is in-lock, it can be argued via the 
central limit theorem that the random variable z is ap- 
proximately Gaussian, with mean and variance obtained 
earlier for either the square-law or the absolute- value de- 


tector. For either detector, the probability of detection 
is 



where \i z and <j\ are given by Eqs. (9) and (10) or by 
Eqs. (16) and (17). Defining the detector’s SNR as 

SNR, i 4 ( 23 ) 

then, for fi z o = 0 (W = 1), the probability of detection in 
terms of SNR* can be expressed as 



The above equation shows the dependence of the proba- 
bility of detection on the detector’s SNR. Phase jitter in 
the tracking loop degrades the detector’s SNR by a factor 
D: 

D = SNR, /SNR, (,'deai) (24) 

where SNR, (ideal) is the detector SNR, assuming infinite 
loop SNR, i.e., no phase jitter. SNR, (ideal) is computed 
from Eq. (23) using the high-SNR expressions in Eqs. (18) 
and (19) for p z and v\. For a given W, M, loop SNR p 
( p = l/<r^ where er^ is given by Eq. 11 or 12 repectively), 
and Pj, the detector’s SNR must be increased approxi- 
mately by the factor 1 /D in order to achieve a desired 
probability of detection. 


III. Discussion and Numerical Results 

Computer simulation was performed in order to check 
the predictions of the analysis. Figure 2 depicts the prob- 
ability of lock detection versus symbol energy-to-noise ra- 
tio E s /No for both sine-wave and square-wave signals, as- 
suming ideal conditions, i.e., no phase jitter in the track- 
ing loop. The square-law detector performs slightly better 
than the absolute-value detector for a given symbol SNR. 

The degradation in detection probability for a finite- 
loop SNR is shown in Figs. 3(a) and 3(b) for the square-law 
and absolute- value detectors, respectively. The threshold 
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r was set to achieve probabilities of false detection Pj of 
10“ 1 and 10 -4 , and detector SNR was set to achieve nomi- 
nal probabilities of detection Pd of 0.99 and 0.90. Nominal 
probability refers to the case of no phase jitter in the loop. 
It is clear that sine waves produce less degradation than 
square waves, and this is true for both detection schemes. 

The performance of both detectors is compared in 
Fig. 4 for square- wave signals only, since the difference 
in performance is almost negligible for sine- wave signals. 
The performance with respect to detector SNR is shown 
in Fig. 5 for a 15-dB loop SNR. The improvement in de- 
tection probability due to windowing is clear for both de- 
tectors and can result in several decibels. Finally, Fig. 6 
depicts both theoretical and simulation points of detec- 
tor SNR degradation D (defined in Eq. 24) versus loop 
SNR. The degradation in SNR is slightly larger for the 
absolute-value detector than for the square-law detector 
when tracking a square wave, but less when tracking a 
sine wave, and it can be as large as 3 dB depending on the 
operating parameters. 

The results are summarized in Figs. 7 and 8. The 
detector SNR as a function of E s /Nq is shown in Fig. 7 
for both infinite and 15-dB loop SNR, and for W = 1.0 
and 0.25. When W = 1, a good rule of thumb is that 
the detector SNR varies linearly with E s /Nq , with slope 
equal to 2/3 on a decibel scale. For different values of A/, 
the curve will be scaled vertically in a linear fashion. Fig- 
ure 8 depicts the detection probability for the square-law 
and absolute-value detectors respectively, as a function of 
E s /Nq for both infinite and 15-dB loop SNR. The con- 
clusion from Fig. 8 is that when operating at low loop 
SNR (i.e., 15 dB), an extra 1.5-dB increase in E 3 /Nq or a 
comparable increase in M will achieve the detection prob- 
ability which was designed for assuming infinite loop SNR. 

For design purposes, Fig. 9 can be useful since it 
depicts both the detection probability and the required 


threshold as a function of M for both detectors. As a 
design example, suppose that the absolute-value detector 
is required to operate at Pj = 1CT 4 and Pd ~ 0.99, and 
that the signal is a square wave with symbol rate r s — 80 
symbols per second, E s /Nq = 0.0 dB, and loop SNR = 
15 dB with a quarter window ( W = 0.25). Figure 9 indi- 
cates that at least 90 detector samples ( ) are needed to 
achieve 0.99 probability of detection. 

Setting M — 100 (integration time = 1.25 sec), r 
is obtained using Eqs. (21) and (22), with Pj = 10 -4 . 
Figure 9 predicts that r should be set to 46, assuming 
that the outputs of the integrate- and- dump devices are 
scaled by the factor 1/^/2 <r^L. Using Fig. 7 (M = 30), 
one can check that when the detector is in-lock, SNR* 

13 4* 101og 10 (100/30) = 16.3 dB, where scaling was per- 
formed to extend the results of Fig. 7 for M = 100. 
This is confirmed in Fig. 5, which depicts Pd = 0.99 for 
SNR* « 16 dB. 


IV. Conclusion 

This article presents a mathematical model of the 
performance of two lock detectors for Costas loops: the 
square-law detector and the absolute-value detector. The 
model concentrates on the impact of phase jitter in the 
tracking loop on the performance of the lock detectors. 
Results of the analysis were verified by computer simula- 
tion and show that low loop SNRs result in a degradation 
in probability of lock detection, the amount of which is 
dependent on the scenario of interest. That decrease can 
be overcome by properly readjusting the design parame- 
ters. It was further shown that the square-law detector 
experiences less degradation due to phase jitter than the 
abolute- value detector, and that the degradation in de- 
tector signal-to-noise ratio is more pronounced for square- 
wave than for sine-wave signals. 
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LOCK DECISION 


Fig. 1. Costas loop with the square-law detector. 



Fig. 2. Probability of detection versus E s /N 0 , infinite loop SNR, sine wave and square wave. 
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Fig. 4. Probability of detection versus loop SNR, square wave. 



Fig. 5. Probability of detection versus detector's SNR. 










PROBABILITY OF DETECTION, P 




Appendix 

Derivation of Detector’s First Two Moments at Low Loop SNR 


A. Square-Law Detector 

Using Eq. (5) with Eq. (4), the expression for lock- 
detector signal is rewritten as follows 

Vk = L 2 Pd (u>1 - vl) + (nj k - n 2 qk ) 

-\-2dkL\fPo (VkHlk — Vk^Qk) 

= Ofc 4* bk + cjc 
M 

z - ^2 (a* + 6* + Cfc) (A--!) 

*=i 

To assess the performance of the lock detector, the first two 
moments of z are needed. For a square-wave sub carrier, 
u>l — v 2 = 1 — 7 | <f)k |, and for a sine- wave subcarrier, 
a >1 — v% = cos 2^. Assuming that <j> is a zero-mean (no 
Doppler) Gaussian phase process, it can be shown that 


Eqs. (A-2) and (A-3) are independent of k. The expected 
value of z is 

H, = m(l 2 P d <1 + La 2 n (l - W)) (A-4) 

where d is the signal degradation factor due to phase jitter 
in the tracking loop, and cF = 0. Using Eq. (A-2) results 
in 

d=i ~ 2 (!) <r *( i “ exp (-si)) 

for a square- wave sub carrier, and 

d = exp^-2cr^ 

for a sine-wave subcarrier. To compute the variance of z, 
Eq. (A-l) is used to get 


I <t>k | = 




and 


(A-2) 


MM M M 

* 2 = a jr jr 9ig j + J2 J2 ( n “ - n Q > ) ( n h ~ 

: = 1 j=l i=l jzzl 


4>l = erf 



a^bex p 

7T 



(A-3) 


M M M 

+ B ^2, ( w * n /» — VkTlQi ) 2 T C ^ gi (n/j “ riQj) 
i= 1 i= 1 j = 1 


where 6 = cr£ is the variance of the phase-jitter process 
in the subcarrier loop, and erf (x) is the error function 
defined as 

erf (x) = -7= [ exp (— t 2 )dt 

Jo 

At first glance, Eq. (A-3) seems to express the variance 
of the phase-error process _as a nonlinear function of itself. 
This is not the case since <j>\ is the variance in the window 
of the loop, and a £ is the variance integrated over the 
complete density. This subtle effect is due to the Gaussian 
assumption, which approximates a density over a finite 
interval by the Gaussian density which is over an infinite 
interval. Numerically, <j>l and are very close for all 
practical values of loop SNR. Note that the moments of 


where A = B = AL 2 P Dy C = 2 L 2 P D) and 9i = 

(u) 2 — vf) = (1 — 7 | <j>i |) for a square-wave subcarrier. In 
the above equation, six terms were left out because their 
expected value is zero. Taking the expected value of z 2 , 
first over the thermal noise, and second over the phase 
process in the tracking loop, the following is obtained: 

l 2 = L A PlM 2 g + 2ML 2 cr A {\ + W 2 ) 

+ M 2 L 2 a A (l — W) 2 

+ AML 3 P D <r 2 (f + hW) 

-f- 2M 2 L 3 <t 2 (1 — W)d (A-5) 
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where 


M M 


= iEEwi 


*'= i i= i 


and /, h are given by 

(A-6) 

ft = v f = <F 

for a square-wave subcarrier and 

/ — u>l = cos 2 <f> = 0.5 (l + exp(— 2<r^ 

(A-7) 

A = sin 2 0 = 0.5(l - exp(-2<r£)) 


to evaluate g , one needs to know the second-order joint 
density function of the phase-error process p(^,^,r). As 
an approximation, it is assumed to be a two-dimensional 
Gaussian density specified by the means, variances, and 
correlation coefficient R{r ), which is obtained by simula- 
tion. Hence, 


p{<f>iAj, T ) ~ 


1 

27T v /fi 2 (0) - Wij) 


exp -■ 


r( o)H - + ^(Q)^ 2 

2(#(0)-#(r)) 


1 

~ C 2 (r) 


/ 0 2 - 2C{r)4>i4>j + 0 2 \ 
V 2<r 2 (l -C 2 (r)) ) 


(A-10) 


for a sine- wave subcarrier. The variance of z can be found 
from the relation <j\ = z 2 — (J) 2 . Using Eqs. (A-4) and 
(A-5) results in 

a 2 = M 2 L A P 2 D {g - d 2 ) + 4 ML 3 P D a 2 n (f + hW) 


Ilere g can be evaluated from the following: 

M M 1 b 


Tp / / 9,9jP(<t>i,<t>i, T ij)d<l>id<i>i (A-ll) 

■=1 i=i 4 4 


+ 2MLV*(1 + W 2 ) 


(A-8) 


r tj = T L (i - j) = t s - t } 


Note that at high loop SNR, g — ► 1, d — ► 1, / — ► 1, h — ► 0, Expanding Eq. (A-ll) for the square-wave subcarrier re- 

and Eqs. (A-4) and (A-8) reduce to suits in 


/i, = ML 2 P d + MLa 2 (l - W) 


t 2 = AML 2 at 


LP d 1 + W 2 



where 


as they should. In order to evaluate g , one must know the 
correlation between samples of the phase-error process in 
the tracking loop. That was obtained by simulation and is 
shown in Fig. A-l. A good closed-form model is given by 



I 01 11^2 I P(01,02,n-j)d0ld02 

(A-12) 


R(t) = a 2 C(r) (A-9a) 

where 

C(r) = (l- exp (-1.25 B L r) (A-9b) 

and where 0 ^ is the closed-loop variance of the phase pro- 
cess and Bi is the one-sided loop bandwidth. In order 


The correlation function in Eq. (A-10) is symmetric, i.e, 
R(rij) = R(Tji) } and depends only on the magnitude of 
the difference k =| i — j |. This property allows the double 
summation in Eq. (A-12) to be reduced to a single sum- 
mation: 

* = w s '(wm 

k = 0 
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where c(0) = M } c(k) — 2(A/ — k) for k — 1, 2, . . . , M — 1, 
and 

b b 

d(k) = // I <t>l II I p(<t>i,<j>2,T k )d<fi 1 d<t)2 

-b -b 


B. Absolute- Value Detector 

At low SNR in the tracking loop, the mean value of 
the detector’s signal z is obtained by taking the expected 
value of yk (Eq. 15) over the phase process in the tracking 
loop, and multiplying the result by M (the number of y * 
samples): 


Note that c (^) ~ M 2 as it should. For k — 0, the 

above probability density function (pdf) reduces to a delta 
function times a zero-mean Gaussian pdf with variance <7^, 
so that 


\4>\ II h | = <?l 


= M(r Ik - r Qk ) 



Because 




the first term of Eq. (A-8) can be rewritten as follows: 


^ = M'HPl ( 1) (s - \a\ 


It can be shown that the lower bound of the above equation 
equals 


^ = ML'Pl ( - ] (l--)tr; 


/2L / ( E, 2 \ r- ( E, v*' 

+ V V"" l eXp J - vW exp - 


where u and v are defined by Eq. (2) or Eq. (3), and 
r/fc and rq * are defined by Eq. (14). The variance of z is 
obtained from a 2 = z 2 — 7 2 , namely 


M M 


M M 


A - E E I XI > II x *i I + E E I x Qi II x Qi I 

*= 1 j = l i= 1 ; = 1 

M M / AT 

” 2 | xi j || xqj | — f | xjj | — | XQi 

i= 1 j' = l \t = l 

The following is now obtained: 


Keeping all detector parameters constant, a s the loop SNR 
decreases (i.e., increases), cr 2 , SNR*, Pj } and P h 
decrease. 


= M ( L 2 P d (u> 2 + u 2 ) + Lai (1 + W) 


For a sine-wave sub carrier, 


9 = 


1 

M 2 


M - 1 

E c w d w 


k = 0 


where c{k ) is the same as in the square-wave sub carrier 
case, and 


d(k) = { 


0.5(l +exp(-2<r 2 )^ 
ex P(— 4^) cosh (2alC(r k )j 


for k~ 0 
fc = l,2,.. 


M— 1 


+ EE ( r '> r/ J + r Qi r <3;) - M 2 (r/ 2 + r Q 2 ) 

all i,j 

W 


where again rj and rq are defined by Eq. (14). Unfortu- 
nately, closed-form solutions for most of the above equa- 
tions are not obtainable and their evaluation has to be 
done numerically. 

The double sum in the above equation equals 


1 

S = J2 c(k)d(k) 
k=l 
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where c(k) = 2 (M - k) for k = 1, 2, . . . , M — 1, and 


m(l 2 P d + L<r“(l + W) - (r7 2 +^Q 2 )) 



x p((^i,h,n)d<l>id4>2 M {l 2 P D {f + h) + La 2 (1 + W) - (rj 2 + rq 2 )) 

where the probability density function is again approxi- 

mated by Eq. (A-10). An upper bound of v 2 is where / and h are defined in Eqs. (A-6) and (A-7). 
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Tracking of interplanetary spacecraft equipped with optical communications 
systems by using astrometric instruments is being investigated by JPL. Existing 
instruments are designed to work at night and , for bright sources, are limited by 
tropospheric errors. To provide full coverage of the solar system , astrometric track- 
ing instruments must either be capable of daytime operation or be space-based. The 
integration times necessary for the ground-based daytime photon statistical errors 
to reach a given accuracy level (5 to 50 nanoradians) have been computed for an 
idea] astrometric instrument. The required photon statistical integration times are 
found to be shorter than the tropospheric integration times for the ideal detector. 
Since the astrometric accuracy need not be limited by photon statistics even under 
daytime conditions, it may be fruitful to investigate instruments for daytime optical 
tracking. 


I. Introduction 

Several observables for spacecraft navigation based on 
a laser telemetry system are being investigated. One such 
observable is the difference in direction between the space- 
craft and a cataloged reference object in the same field 
of view of an astrometric telescope. The error for such 
a ground-based differential astrometric measurement in- 
cludes the directional error of the reference source, the 
photon statistical error, and the error induced by index of 
refraction fluctuations in the troposphere. The best condi- 
tions for making differential astrometric measurements are 
at night, when the low background light levels lead to small 
statistical errors, and at high angles of elevation above the 


horizon, where the troposphere errors are smallest due to 
the short path length through the atmosphere. 

Unfortunately, on any given day only a small portion 
of the solar system can be observed at high elevations dur- 
ing the night. In Section II, the daily available observa- 
tion times for the planets are examined for the cases in 
which observations are restricted to nighttime and limited 
in elevation. All the planets suffer seasons in which no 
nighttime viewing is possible, corresponding to periods of 
low Sun-Earth-Probe (SEP) angles. While nighttime an- 
gular position measurements might augment other observ- 
able types during a fraction of a mission, critical parts of 
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a mission may take place at low SEP angles due to launch 
criteria. The restricted nighttime visibility implies that 
ground-based astrometric observations of laser-equipped 
spacecraft may have to cope with the daytime sky back- 
ground. 

In Section III, the photon statistical error of an as- 
trometric measurement in the presence of a background is 
analyzed. In Section IV, some specific examples of the pho- 
ton statistical error are presented. In Section V, a simple 
model of the tropospheric error is discussed. The relative 
size of these errors is affected by the angular separation 
of the reference source and the spacecraft. A larger sepa- 
ration is more likely to allow the use of brighter reference 
objects and result in a smaller photon statistical error. A 
small separation would require the utilization of fainter ref- 
erence objects but would decrease the tropospheric error. 
The smallest usable field of view is set by the density of 
the catalog stars. It was expected that in the mid 1990s 
the Hipparchos mission would provide a catalog with an 
average of 2.5 stars per 1-degree by 1-degree field with ini- 
tial accuracy of 10 nrad per component and proper motion 
uncertainty of 10 nrad/year [1]. The average brightness of 
reference sources in a 1-degree by 1-degree field is magni- 
tude m v = 8 [2], Using magnitude m v = 8 and 1 degree 
for the source-spacecraft separation, it is found that the 
integration times for daytime observations with 50 nrad 
accuracy are about 30 minutes. 

For the error models used here, the tropospheric error 
for such measurements is larger than the photon statistical 
error. However, the daytime photon signal-to-noise ratio 
is very unfavorable. Existing astrometric instruments, de- 
signed for nighttime operation, are not able to work in 
this regime. Limitations imposed by real detectors or by 
systematic effects such as sky brightness variation may be 
much larger than the photon statistical error. 


II. Nighttime Planetary Viewing 
Limitations 


Figures 1-7 are plots of the number of minutes per day 
that Venus, Mars, Jupiter, and Saturn are above 10 degrees 
or 30 degrees elevation with the sun below —15 degrees for 
the time span of 1990 to 2000. Mercury is almost never 
visible in the dark sky and is not plotted. Venus is al- 
ways at low elevation, and nighttime astrometric tracking 
is possible less than half of each year. The outer planets 
are unavailable for 25 percent or more of each year. These 
figures suggest that visibility limitations will severely af- 
fect the utility of astrometric tracking if it is limited to the 
nighttime hours. 


III. Photon Statistics of an 
Astrometric Telescope 

This section presents an estimate of the accuracy lim- 
itation placed by photon statistics on an astrometric tele- 
scope in the daytime. This category includes Ronchi-ruled 
telescopes and charge-coupled device (CCD) instruments. 
An ideal detector is able to record the position on the fo- 
cal plane for each detected photon. Any real instrument 
will have larger photon statistical errors than this ideal in- 
strument. CCD instruments are capable of dividing the 
image of a point source into many pixels and can approach 
the ideal detector scheme. However, limits on the size of 
CCD arrays limit the fields of view of such instruments. 
In a Ronchi telescope, a moving ruling is used to modulate 
the light incident on the detector with position informa- 
tion derived from the detected modulation [3]. For present 
designs, the field of view must be larger than the image of 
the star. This makes Ronchi telescopes more susceptible 
to background light problems than CCD instruments. 

The ideal astrometric telescope tracks a source for a 
time T and records the plane-of-sky coordinates (£, rj) = Q 
for each photon detected. The troposphere and instrument 
resolution cause the photons to be smeared in the plane of 
the sky. For simplicity, the photon spatial distribution 
about the true source direction is assumed to have the 
Gaussian form 


To demonstrate what fraction of a mission trajectory 
would be inaccessible to nighttime astrometric measure- 
ments, the number of minutes per day that each planet is 
visible from Goldstone in the night sky has been computed. 
Here “night” has been defined as the time that the sun is 
below —15 degrees elevation. This value for sun elevation 
was chosen to correspond roughly to the time of astro- 
nomical darkness. Since the astrometric error depends on 
elevation angle through the elevation dependence of the 
troposphere, two different minimum elevation cutoffs for 
the planets were included. 


ST 


I(Q - Q g ) = BT + j exp 


2cr 2 


(i) 


where B is the number of background photons per unit 
solid angle per unit time, Q g is the true source direction, 
and S is the number of signal photons per unit time. This 
distribution is a reasonable approximation to a smeared 
point source in a uniform background and has been suc- 
cessfully used in fitting high-precision small-field images 
[4]. The width of the Gaussian is determined by the ap- 
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parent diameter of a point source. This apparent diame- 
ter as determined by the turbulence of the atmosphere is 
commonly called the seeing angle. The full width at half 
maximum of the Gaussian (2.35<x) is here set equal to the 
seeing angle. 

Measurement of the source direction consists of 
recording the arrival of n photons and the plane-of-sky 
coordinates for each detected photon. The maximum- 
likelihood method can be used to estimate the source di- 
rection. The likelihood function L(Q a ) is the probability 
of recording n photons with the set of arrival directions 
{0*} given an assumed source direction fi a . 


where <r| is the variance of the source direction estimate 
in the £ direction, and the expectation value is the average 
taken over all possible numbers of detected photons and 
sets of directions. Since the assumed photon distribution 
is symmetric, the error is the same for the 77 direction. 
With some work, the error expression of Eq. (6) can be 
applied to the likelihood function given in Eq. (5) to give 
the error in the source direction measurement as 



where the function f(a) is given by 


The photons are distributed in time according to Pois- 
son statistics. The expected number of detected photons 
is given by 


/(«) = 



x 3 e~ x 

-U. dx 

a + e-* J /2 ax 


( 8 ) 


N = 



In the limit of no background (B = 0), a = 0, and /( 0) = 
1; the directional error is then given by 


where the integral is taken over the field of view of the 
telescope. The probability of detecting n photons is then 
given in [5] as 


P(n) 


N n e~ N 


(3) 


The (normalized) probability of any one photon arriving 
with direction fit given that the source is at direction Q a 
is 


P( n.io.) = m N Q ‘ ] (4) 

Assuming that the distribution for each photon- arrival di- 
rection is independent, the probability of finding n photons 
with the set of photon directions {ft*} is 



/(ft* - fi«) 

N 


(5) 


The maximum-likelihood estimate of the source di- 
rection is the assumed source direction that maximizes the 
likelihood function. In the limit N > 1 the error asso- 
ciated with this estimate is given by the Konig-Kramer 
bound [6] 


3 2 logL(ft a ) 

ea 


( 6 ) 




<x ( = 


a 

VST 


(9) 


Rewriting Eq. (9) gives the integration time to reach a 
specified accuracy < 7 ^ as 


T- 



( 10 ) 


In the limit of high background, the position error is given 
by 


cr /« 

= ~^/ST V ~ ~ 


( 11 ) 


Rewriting Eq. (11) gives 



( 12 ) 


The effective signal- to- noise ratio for determining the 
source direction is S/($n<r 2 B) where the combination 
8? ra 2 B is the effective background rate. 


IV. Examples of Photon Statistical 
Integration Umes 

In this section, the integration times needed to reach 
a given accuracy for a reference star and for a spacecraft 
at 10 AU with a 2-W laser are given. The parameters as- 
sumed for the detector, the spacecraft, and the background 
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are summarized in Table 1. The derived photon rates and 
integration times are listed in Table 2. 


The photon rate from a star of visual magnitude m v 
is given by [2, 7] as approximately 



where A is the central wavelength, h is Planck’s constant, 
c is the speed of light, AA is the wavelength pass band, 
d r is the telescope diameter, rj a is the atmosphere trans- 
mission factor, T) ra is the receiver obscuration factor, Tj ro is 
the receiver optics efficiency, r\j is the narrow-band filter 
efficiency, and rjd is the detector quantum efficiency. Since 
refraction will cause image smearing if the wave band is 
too wide, a value AA = 0.1 ^m, centered at A = 0.532 ^/m, 
will be used below. For the receiver efficiencies given in 
Table 1 and for a star of magnitude m,, — 8 the signal rate 
is 6.8 x 10 5 photons/second. 

The photon rate received from the spacecraft is given 
by [7] as 



X T ]ta T }to 7 ]tpVa r ]ra T ]ro (14) 

where P\ is the transmitted laser power, A is the wave- 
length, d t is the transmitter objective diameter, r is the 
spacecraft-receiver distance, r] ta is the transmitter obscu- 
ration factor, T] t0 is the transmitter optics efficiency, and 
7] t p is the transmitter pointing efficiency. The parameters 
used here are taken from an example by Kerr [8] and listed 
in Table 1. These factors combined with the nominal re- 
ceiver used above yield a detected photon rate S of 2.6 x 10 4 
photons/second for the spacecraft. 

A critical parameter for the tracking telescope is the 
atmospheric seeing. At many sites, the daytime seeing is 
worse than the nighttime seeing by a factor of 5 to 10 [9, 
10]. However, at some solar observatory sites, the seeing 
is 5 to 10 /irad (~1 to 2 arcseconds) both day and night. 
At the Sacramento Peak site, the daytime seeing angle is 
reported as being less than 10 /irad 80 percent of the time 
and better than 5 /irad 50 percent of the time during the 
day [11]. Since 10 /irad is similar toother reported daytime 
seeing values (see [11] and references therein), 10 /irad full 
width at half maximum will be taken as a nominal value. 
This corresponds to a value a — 4.3 /irad for the photon 
distribution. 


The background photon rate is given by 

(8 ttot 2 )5 = (8jtct 2 ) I (^) AA^ <f r T] ro T) ra j]jr] d (15) 

where I is the background spectral irradiance. The 
daytime background spectral irradiance depends on the 
weather, the sun elevation, and the Sun-Earth-Probe 
angle among other factors. The value for I of 
100 W/(/^m m 2 steradian) at 0.532 fi m [12] will be used 
in the following example. Kerr gives values ranging from 
30 W/(/xm m 2 steradian) at 90 degrees from the sun to 
500 W/fxm m 2 steradian at 10 degrees to the sun [8]. Us- 
ing the receiver values of Table 1 and 2-arcsec seeing, the 
daytime background rate is 2.6 x 10 9 photons/second. Us- 
ing a narrower filter will not improve the signal-to-noise 
ratio for the reference star since the signal photon rate 
depends on the bandwidth in the same way as the back- 
ground; narrowing the filter bandwidth reduces the signal 
rate and increases the needed integration time. However, 
the laser signal is narrow band and narrowing the filter 
bandwidth improves the spacecraft signal-to-noise ratio. 

Table 2 presents the integration times needed to reach 
50-nrad or 5-nrad accuracy for the reference star and the 
spacecraft. Several different filter bandwidth options have 
been used. For the case of the 0.03-nm filter, the filter 
transmission efficiency is reduced to 0.4 [8]. The inte- 
gration times for the photon statistics error to reach the 
5-nrad level are not prohibitively long provided that the 
spacecraft and star are separately filtered. However, sev- 
eral factors may combine to lengthen the integration times. 
The photon rates for the reference star plus background are 
very high for photon counting devices. Detector dead time 
and noise will be significant effects. A Ronchi telescope in- 
tegrates the background over a fixed field of view. Unless 
the field of view is limited to the size of the source image 
and precisely positioned, the photon statistics error will 
be much worse for a Ronchi telescope. In any case, the 
ruling modulation reduces the photon flux by a factor of 
2, correspondingly increasing the integration time. If each 
coordinate is measured separately, the times will double 
again. 

Effects other than counting problems are important. 
Since the integration time increases as the fourth power 
of the seeing, the site selection is critical. For the ref- 
erence star, the signal-to-noise ratio may be worse than 
10 -3 . This may pose unrealistic requirements on the dy- 
namic range and linearity of the detector. There is also 
the possibility that variations in the background intensity 
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over the period of integration could degrade the accuracy 
of the star position. 


V. Tropospheric Integration Times 

The photon statistics contribute an angular position 
error for the spacecraft and the reference object. The error 
in the angular difference between the spacecraft and refer- 
ence object is also affected by variations in angle of arrival 
imposed by the troposphere. The integration time require- 
ments imposed by the troposphere may be computed from 
a result by Lindegren [13]. Lindegren’s result is derived 
from a frozen turbulence model of the atmosphere with 
a power law structure function. The turbulence causes 
angle of arrival variation as the frozen turbulence moves 
with the wind velocity. The variations in angle of arrival 
from two point sources are computed, given some assump- 
tions about the atmosphere structure and averaging over 
wind direction. Lindegren’s paper compares his model to 
several experimental results, including stellar position and 
solar diameter measurements, with reasonable agreement. 

The expression for the error cr# in the difference angle 
0 between two sources is 

a e = 6.3 x lO -6 0*T-7 (16) 

where <r$ and 0 are given in radians and the integration 
time T in seconds. Solving Eq. (16) for the integration 
time necessary for a given angular accuracy uq gives 

T = 4.0 x Hr n 0i<7- 2 (17) 


This expression applies near zenith and with about 1- 
arcsec seeing. Results are expected to be worse for lower 
elevation angles and worse seeing. 

For two objects 1 degree apart, this implies an in- 
tegration time of 59 hours to reach 5-nrad accuracy and 
35 minutes to reach 50-nrad accuracy. These times are 
long compared to those imposed by the photon statistics 
for a magnitude 8 star. There is a possible trade-off by 
narrowing the field of view and utilizing fainter reference 
stars. However, this implies a larger catalog effort. By 
including more reference objects in the field, there is the 
possibility of reducing the troposphere error. 


VI. Discussion 

The photon statistics do not rule out the operation 
of astrometric telescopes for daytime optical tracking of 
spacecraft. Given the simple assumptions used in Sec- 
tion III, the tropospheric error dominates the photon sta- 
tistical error for a magnitude m v = 8 star and 1-degree 
source-spacecraft separation. However, existing instru- 
ments are not capable of operating in the daytime because 
of the extremely poor signal-to-noise ratio and large pho- 
ton fluxes. Finding a suitable detection scheme for day- 
time operation will be a challenge. Since the integration 
times required are many minutes, systematic effects and 
variations in the background level may wash out the posi- 
tion signal. It would be useful to try differential position 
measurement in the daytime at a solar observatory with 
a CCD camera to begin an investigation into systematic 
background effects. 
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Fig. 1. Plot of the number of minutes per day (24 hours) for 
which Venus is visible above 10 degrees in the nighttime sky. 



Fig. 2. Plot of the number of minutes per day (24 hours) for 
which Mars Is visible above 10 degrees in the nighttime sky. 
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Fig. 3. Plot of the number of minutes per day (24 hours) for 
which Mars Is visible above 30 degrees in the nighttime sky. 



Fig. 4. Plot of the number of minutes per day (24 hours) for 
which Jupiter is visible above 10 degrees in the nighttime sky. 
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Fig. 5. Plot of the number of minutes per day (24 hours) for 
which Jupiter is visible above 30 degrees in the nighttime sky. 
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Fig. 7. Plot of the number of minutes per day (24 hours) for 
which Saturn is visible above 30 degrees In the nighttime sky. 


Fig. 6. Plot of the number of minutes per day (24 hours) for 
which Saturn Is visible above 10 degrees in the nighttime sky. 
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The new Viterbi decoder for long constraint length codes , under development 
for the DSN , stores path information according to an algorithm called “traceback.” 
The details of a particular implementation of this a Igorithm, based on three memory 
buffers, are described . The penalties in increased storage requirement and longer 
decoding delay are offset by the reduced amount of data that needs to be exchanged 
between processors , in a parallel architecture decoder. 


I. Introduction 

A new, long constraint length Viterbi decoder [1] is 
under development for the DSN, and will be used to decode 
the constraint length K = 15 experimental code adopted 
by the Galileo mission. This article describes the traceback 
algorithm that is used in this decoder to store the most 
likely paths into each node of the decoder’s trellis. 

The traceback (TB) method is one of two known ways 
to store decisions made by the add-compare-select unit of a 
Viterbi decoder, and then provide decoded bits as output. 
The other method is the more traditional register exchange 
(RE) technique. The RE method is suitable for short con- 
straint length decoders or for low-speed decoders due to 
the large amount of data that needs to be read, modified, 
and rewritten at each bit time. 

The basic difference between these two methods is 
that the RE method stores the actual hypothesized infor- 
mation sequences (survivors), while the TB method stores 


the results of comparisons of paths converging into each 
node of the trellis. The maximum-likelihood path is then 
found by tracing back through the trellis a path, according 
to stored decisions. It is worth observing that the bits rep- 
resenting the results of these comparisons actually coincide 
with the information bits in convolutional codes where the 
state transitions are governed by a shift register. There- 
fore, the crucial difference between RE and TB methods 
lies in the organization of the memory used to store the 
survivors. The TB method is widely used in practice, but 
not as widely described in the literature. It is mentioned in 
[2], and described in [3] without any reference to required 
storage or resulting decoding delay. 

This article deals with the details of one version of the 
traceback algorithm that is based on three pointers explor- 
ing three memory buffers. Details on the hardware imple- 
mentation of this version of the algorithm may be found in 
[4]. If L branches are required as the minimum decoding 
depth ( L « 5A' is a usually accepted rule, but L « 10/i is 
more realistic for low Eb/No applications), enough storage 
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must be provided for 3 L branches in order to perform the 
necessary buffering for this TB method. This penalty is 
not important in practice since inexpensive and slow off- 
chip memory can be used, while the RE method requires 
fast on-chip registers. 


II. Memory Management for Traceback 

The memory required for the traceback method is or- 
ganized in three banks as shown in Fig. 1. Each bank is L 
bits long and 2 K ~ l bits high (the number of states), which 
gives a required storage of about 1 Mbyte, for K = 15 and 
L = 170 (not including additional storage for 2 K ~ l accu- 
mulated metrics). Any bit in this traceback memory can 
be accessed by an address consisting of the state j (0 < 
j < 2 k_1 ) and the bit memory pointer m (0 < m < 3 L). 

There are three basic operations going on in the mem- 
ory banks every bit time: 

(1) Traceback , which is a “read” operation and traces a 
path between states on the trellis by computing the 
next (backward) state address from the presently read 
memory content. 

(2) Decoding , which is also a “read” operation and similar 
to traceback, except that it is performed on “older” 
data and it produces output information bits corre- 
sponding to the path being traced. 

(3) Writing new data (decisions given by the add-com- 
pare-select unit), which moves forward on the trellis. 
These bits can be written in the locations just freed 
by the decoding operation. 

Every L bit times a new traceback front is started in 
one memory bank from state 0, and a new decode front 
is started in a different memory bank at the state where 
the previous traceback ended. New data is written into 
the second memory bank as memory locations are freed 
by the decode operation. The third memory bank re- 
mains inactive. After a period of L bit times, the trace- 
back/decode/write operations are switched to a different 
pair of memory banks, as described in detail below. 

Among the three operations, the writing of new data 
is by far the most time consuming, since 2 K ~ l bits must 
be written for each information bit time. Read operations 
only access one bit per information bit time. 

Given the amount of required memory, it is cheaper 
to use commercial RAM chips, rather than to design this 
memory into custom VLSI circuits. 


III. The Traceback Algorithm in Detail 

The evolution of memory operations needed for the 
traceback method is illustrated in Fig. 2, where the vertical 
axis represents the elapsed time and each box represents 
the status of memory at a given time. The variables’ names 
are the same later used in the pseudocode description of 
the algorithm in Fig. 4. 

At time = 0, a new traceback is started from state 
0 ( stateJb = 0) at bit memory pointer m — L — 1 in 
the rightmost memory bank (bank = 0). The top row of 
Fig. 2 shows the memory bank number where traceback 
( tb ) and decoding (dec) operate. This traceback will end 
in a certain stateJb at bit memory pointer m = 0. 

Simultaneously, a decoding operation starts from state 
state-dec (this is initially an arbitrary value) at bit memory 
pointer m = L— 1 and proceeds until m = 0, in the leftmost 
memory bank (bank 1). For each decoded bit, a full column 
of bits can be overwritten with new data. Notice that 
all three operations (traceback, decode, and write) evolve 
from right to left. 

At the end of the first block of L bits, a new traceback 
starts from state 0 ( stateJb = 0) at bit memory pointer 
m = 0, moving from left to right in memory bank 1. At 
the same time, a decoding operation goes on in the middle 
bank (bank 2), starting at the state where the previous 
traceback ended (state jdec = stateJb) and at bit memory 
pointer m = 0. Also, new data is written in the bank doing 
decoding. All of these operations evolve from left to right. 

After the end of the second block, a traceback and a 
decoding start at m = L — 1, both moving from right to 
left, in banks 2 and 0, respectively. New data is written in 
bank 0. After the third block, i.e., during the fourth block, 
traceback and decoding take place in banks 0 and 1 again, 
but all operations occur left-to-right, which is opposite to 
the direction for the first block. Notice that the read/write 
operations alternate between right-to-left and left-to-right 
sweeps, which implies that Fig. 2 has a repetition cycle of 
3x2 = 6 blocks. 

Only after three full blocks is decoding performed on 
data that has actually been written, rather than on initial- 
ized memory. Therefore the decoding delay is at least 3 L 
bits. Since the decoded output is generated in reversed or- 
der (last bit first in each block), one must provide a buffer 
to reverse the output, bringing the decoding delay to 4 L 
bits. Finally, the delay due to the encoder memory must 
be added, which yields a total decoding delay of AL + K 
bits. 
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The flow diagram of memory operations is shown in 
Fig. 3, where the traceback and decoding operations are 
shown as sequential in time, even though they may happen 
simultaneously in a specialized hardware. 

Figure 4 shows the pseudocode description of the TB 
algorithm for a convolutionally coded system using a 
(7,1/2) code and L = 100. The C-language version of this 
algorithm has been used to demonstrate this concept by 
software simulation and to verify the correctness of the al- 
gorithm. The memory is denoted by the three-dimensional 
array RA M [st ate][m] [6a nk], M[state][.] stores the accumu- 
lated metrics, and d[] represents the branch metrics. 


of Traceback for Parallel 

In a multiprocessor implementation, the Viterbi algo- 
rithm based on register exchange requires that the full sur- 
vivor sequences be exchanged among processors, together 
with the accumulated metrics. It has been found [5] that 
the traceback method drastically reduces the communi- 
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IV. Advantages 
Processing 


cation bandwidth required between processors by elimi- 
nating the need for survivor exchanges. This reduction is 
achieved at the price of higher decoding delay. 

Figure 5 shows a general parallel architecture, where 
the interconnection network is described in [6]. Since, 
among the three operations described in Section II, the 
write operation is the most demanding, it is performed con- 
currently in each processor and its local memory. Each pro- 
cessor operates sequentially on a certain number of states, 
and then exchanges the accumulated metrics through the 
interconnection network. The traceback/decoding opera- 
tion may use a bus line to transfer information, consisting 
of traceback memory addresses sent to all memories and 
single bits coming from a particular memory, correspond- 
ing to the memory location referenced by a given address. 
The new address is computed from the old one and the 
data bit read from a memory. The address computation 
does not require parallelism, since it uses only one bit read 
per information bit. At the end of each block the last ad- 
dress found by the traceback unit is used to initialize the 
decoding unit. Further details on the hardware implemen- 
tation may be found in [4]. 
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Fig. 1. Memory organization in three banks. 
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Fig. 3. Flow diagram of memory operations. 











constraint length 
number of states” 


K«7 

NS=2 A (K-l) 

N$2=NS/2 

L^lOO "truncation length" 

state_tb=0 

time=0 


WHILE time < max 

{ 

m-time MOD L 
bit_par=time MOD 2 

IF(m--O) 

{ 

blk_no=time/L 
blk_par=blk_no MOD 2 
tb=blk_no MOD 3 
dec= (tb+1) MOD 3 
state_dec=state_tb 
state_tb=0 

FOR m FROM 0 TO L-l { outr [m] =out [m] } 

} 

IF (blk_par=-0) {ml^L-m-l } 

ELSE {ml— m} 


"start a new traceback front" 


"bank doing traceback = 0,1,2" 

"bank doing decoding" 

"set starting state for decoding" 

"set starting state for new traceback 
"buffer for order reversal" 


"right to left" 
"left to right" 


state_dec= (SHIFT RIGHT state__dec OF 1 BIT) OR (NS2*RAM [ state_dec ] [t ] [dec] ) 

"decoding: address in bank=dec" 

state_tb- (SHIFT RIGHT state_tb OF 1 BIT ) OR (NS2*RAM [ state_tb ] [t] [tb ]) 

"traceback: address in bank=tb" 


out [m] = (SHIFT RIGHT state_dec OF K-2 BITS) AND 1 


"output buffer" 


FOR j FROM 0 TO NS-1 "add, compare, select" 

{ 

i=SHIFT RIGHT j OF 1 BIT 

LO = M [ i] [bit_par XOR 1] + d [ K [ j ] ] 

LI - M[i OR NS2] [bit_par XOR 1] + d[3-K[j]] 

IF (LI < LO) { "write one bit" 

M [ j ] [bit_par ] =L1 
RAM [ j ] [ml] [dec]=l 
} 

ELSE { 

M [ j ] [bit__par ] =L0 
RAM [ j ] [ml] [dec] =0 
} 

} 


PRINT outr[L-l-m] "print decoded bits in correct order" 

time “ time+1 

} 







ADDRESS BUS 
DATA BUS 


P = PROCESSOR 
M = MEMORY 


Fig. 5. Parallel traceback architecture. 

















TDA Progress Report 42-99 


N90- 19444 

V3/S 


November 15, 1989 

tit 


Some Easily Analyzable Convolutional Codes 

R. McEliece, S. Doiinar, and F. Pollara 

Communications Systems Research Section 

H. Van Tilborg 

Eindhoven University, Mathematics Department, The Netherlands 


Convolutional codes have played and will play a key role in the downlink teleme- 
try systems on many NASA deep-space probes , including Voyager , Magellan, and 
Galileo. One of the chief difficulties associated with the use of convolutional codes , 
however , is the notorious difficulty of analyzing them . Given a con volution a/ code 
as specified , say, by its generator polynomials, it is no easy matter to say how well 
that code will perform on a given noisy channel. The usual first step in such an 
analysis is to compute the code 7 s free distance; this can be done with an algorithm 
whose complexity is exponential in the code’s constraint length . The second step 
is often to calculate the transfer function in one, two, or three variables, or at least 
a few terms in its power series expansion. This step is quite hard, and for many 
codes of relatively short constraint length, it can be intractable. However, we have 
discovered a large class of convolutional codes, for which the free distance can be 
computed by inspection, and for which there is a closed-form expression for the 
three-variable transfer function. Although for large constraint lengths, these codes 
have relatively low rates, they are nevertheless interesting and potentially useful. 
Furthermore, the ideas developed here to analyze these specialized codes may well 
extend to a much larger class. 


I. Introduction 

In this article a class of binary (n, 1), constraint length 
K, convolutional codes, called zero-run length (ZRL) con- 
volutional codes , is defined and studied. These codes are 
interesting because they are easy to analyze. ZRL codes 


include as special cases orthogonal convolutional codes, the 
recent “superorthogonal codes” of Viterbi, and many oth- 
ers. None of the convolutional codes currently used in 
NASA missions belong to the ZRL class. For any ZRL 
code, it is possible to compute the free distance by inspec- 
tion, and to write down the complete transfer function 
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T(D,I,L), explicitly (see Theorem 7, below). Important return to this state diagram in the proof of our main result, 
variations of the transfer function, viz. Theorem 7, below. 


T num (D) = T(D, 1,1) 

T len (D) = 1,1) 

are commonly used to overbound the probability of de- 
coder error for these codes ([3], Section 9.3, or [4], Sec- 
tion 4.4). For arbitrary convolutional codes, these func- 
tions can be very complicated indeed (see [7]), but for any 
ZRL code these functions have simple, closed- form expres- 
sions (see Corollary 8). 


II. Zero-Run Length Convolutional 
Codes 

Any ( n , 1), constraint length K convolutional code is 
characterized by a list of n generator polynomials (</i(z), 

. . g„(x)), where gi(x) = gi,o +9i,iX-\ 1- gi,K-ix l< ~ 1 is 

a polynomial of degree K — 1 or less. The encoder for such 
a code consists of a shift register of length K — 1, with one 
input and n outputs; the state of the encoder is defined 
to be the contents of the shift register. If («i, . . . , sjr-i) 
is the current state, and $o is the current input, the next 
state is (so, . - . , sk - i ) and the output, which we will call 
a code segment , is the n-tuple (yi,...,y n )> where y, = 

E k-i 

j—o 


1. Definition. An encoder state s = (si«2 * * * S K- 1) 
is said to have zero-run length t, written “ZRL(s) = i” 
for short, if s contains exactly i leading zeros. For exam- 
ple, with I< = 5, ZRL(OOIO) = 2, ZRL(OOOO) = 4, and 
ZRL(lOOl) = 0. In general, for an (n, 1), constraint length 
K t convolutional code, there will be 2 K ~ l states, but only 
K possible values for ZRL (0, 1, . . . , I\ — 1). 

Note that if the encoder is in a state of zero-run length 
t, and the input is 0, the next state will have ZRL = 
min (i + 1, K — 1), whereas if the input is 1, the next state 
will have ZRL = 0. Thus the ZRL of the encoder’s next 
state depends only on the current value of ZRL and the 
input. This fact is illustrated in Fig. 1, which shows the 
topology of states, organized according to the values of 
ZRL. In Fig. 1, the arrows marked with a’s represent state 
transitions caused by 0 inputs, and the arrows marked with 
/?’s represent state transitions caused by 1 inputs. We will 


2. Definition. An (n, 1) convolutional code of con- 
straint length K is said to be a ZRL code if the output 
weight depends only on the input and the ZRL of the state. 
The symbol is used to denote the output weight if the 
encoder has ZRL = i and the input is 0, and the symbol 
Wi is used if ZRL = i and the input is 1. The u t ’s and the 
Wj’s are conveniently displayed in a 2 x I\ matrix, called 
the weight matrix of the code; 

0 1 •• K — l 

0 ( u 0 Ui * " uk- 1 

W = 

1 \ W 0 U) 1 • ■ ■ W K - 1 

3. Example. Let K = 3. Then the (4,1) con- 
volutional code with generator polynomial list (l,x,l + 
x 2 , 1 + x + x 2 ) is a ZRL code. Since with K = 3 there 
is only one state with ZRL = 1, viz. 01, and only one 
state with ZRL — 2, viz. 00, in order to verify that this 
code is ZRL, one need only investigate the two states with 
ZRL = 0, i.e., 10 and 11. If the state is 10 and the input 
is 0, the output is (0101), whereas if the input is 1 the 
output is (1110). On the other hand, if the state is 11 and 
the input is 0, the output is (0110), and if the input is 1, 
the output is (1101). Thus, if the state has ZRL = 0, and 
the input is 0, the output weight is 2; and if the input is 1, 
the output weight is 3. Hence, the output weight indeed 
depends only on the state’s ZRL, as required. The weight 
table for this code is as follows; 


0 1 2 
0/220 

W = 

1 yz 1 3 

4. Definition. The profile of an (n,l), constraint 
length K ZRL convolutional code is the vector (di,d 2 » ■ ■ ■ > 
c I k ), where is the Hamming weight of the output of the 

encoder, beginning in a state with ZRL ~ 0, with length i 
input sequence 0 l-i l. 


5. Lemma. In terms of the entries in the weight 
table, the profile of a ZRL convolutional code is 


di = uo + U\ + • * * + ti*_2 + Wi-l 


for % — 1,2, . . . , K 

Proof: If one starts in a state with ZRL = 0, and 
uses the input sequence 0*" 1 , one passes through states 
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with ZRL = 1, 2,..., i - 2, causing outputs of weight 
ti 0 , ui, . . . > u i-2» and arrives at a state with ZRL = * — 1. 
The last input of 1 causes the encoder to move to a state 
with ZRL = 0 and to produce an output of weight Wi-\. 

6. Example. Combining the weight table in Exam- 
ple 3 with Lemma 5, one finds that the profile of the code 
in Example 3 is (3,3,7): d\ = wq = 3; d >2 — uo = 
2 + 1 = 3; and d 3 = uq + ti i 4- 1^2 = 2 + 2 + 3 = 7. 


pair of vertices in the sequence is connected by a directed 
edge. For example, in Fig. 2, the vertex sequence 00 — ► 
10 — ► 01 — ► 00 is a path of length 3. A path is completely 
specified by its initial vertex and the string of input bits 
corresponding to the vertex transitions, which we call the 
input string of the path. For example, the path 00 — ► 10 — ► 
0 1 — ► 00 has initial vertex 00 and input string 100. The 
weight of a path is defined to be the product of the labels 
on its edges. For example, the path 00 — ► 10 — ► 01 — * 00 
in Fig. 2 has weight D w *+ U °+ U ' IL Z , 


111. Transfer Function for ZRL Codes 

The following theorem is our main result. It gives the 
promised closed-form expression for the transfer function 
of a ZRL code in terms of its profile. 

7, Theorem. For a ZRL convolutional code with 
profile (di , . . . , dx ) , the three- variable transfer function is 
given by 


T(D t I,L) 


D dK IL K 

K-l 

l- y Z ° diILi 

i= 1 


Proof: One begins by reviewing the definition of T(D, 
I } L ) for an arbitrary (n,l), constraint length I < , convolu- 
tional code. (See [3] or [4] for more details.) 

Starting with the state diagram for the given code, 
which is the 2 K ~ l vertex deBruijn graph, each of the 2 K 
edges is labelled with a monomial in the three indetermi- 
nates D , /, and L, i.e., a term of the form D w I € L. The 
power w of D in the monomial represents the Hamming 
weight of the encoder output corresponding to the given 
state transition, and c is either 0 or 1, according to whether 
the corresponding encoder input is zero or one. The re- 
sulting labelled, directed gTaph is called the “ DIL state 
diagram” for the code. 

In Fig. 2 is the DIL state diagram for a K = 3 ZRL 
code. For example, in Fig. 2 the edge from state 10 to 11 
is labelled D W °IL. This is because the transition 10 — + 11 
is caused by an encoder input of 1, so that the exponent 
of I in the edge label is 1. State 10 has ZRL = 0, and by 
definition of a ZRL code, when the state has ZRL = 0 and 
the input is 1, the output weight is u>o; thus the exponent 
on D in the label is w 0 . The other seven edge labels can 
be explained similarly. 

A path of length m in the DIL state diagram is defined 
as a sequence of m + 1 vertices such that each adjacent 


The three- variable transfer function T(D , J, L) is now 
defined to be the sum of the weights of all paths from vertex 
0*- 1 back to vertex 0 K ~ l which have no intermediate re- 
turns to vertex 0^ -i . Alternatively, T(D y I,L) is the sum 
of the weights of all paths with initial vertex O^ -1 whose 
input string ends with O^" 1 but has no other substring 
equal to O* -1 . (In [3, Section 9.3] these paths are called 
“fundamental paths.”) 

In principle, one can compute T(D, 7, L) for any con- 
volutional code using the standard “transfer matrix 
method” described, for example, in [5, Sec. 4.7]. How- 
ever, this method is essentially equivalent to inverting a 
2 k ~ 1 x 2 K ~ l matrix with three- variable monomial entries, 
and is not in general practical except for codes with ex- 
tremely small constraint lengths [7], However, for a ZRL 
code, one can simplify this calculation considerably, by 
first “collapsing” the state diagram by combining states 
with the same value of ZRL. In the collapsed state di- 
agram, there will be K vertices, labelled 0,1 ,..., K — 1; 
vertex i will be connected by a directed edge to vertex j if 
there is any edge in the original (noncollapsed) DIL state 
diagram connecting a vertex with ZRL = i to one with 
ZRL = j. The label on an edge in the reduced state di- 
agram will be the same as the label on the corresponding 
edge in the original graph; the ZRL property implies that 
this rule is well defined. 

The collapsing process is illustrated in Fig. 3, which 
shows the collapsed version of the graph in Fig. 2. Note, for 
example, that in Fig. 3 the edge from vertex 0 to vertex 1 
is labelled D U °L. This is because in Fig. 2, both edges 
from a vertex with ZRL = 0 to a vertex with ZRL = 1, 
viz. 10 — > 01 and 11 — *■ 01, have the same label D U °L. 

When the DIL state diagram for a constraint length 
I< ZRL code is collapsed, the resulting state diagram will 
be identical to the state diagram in Fig. 1, where the labels 
c*i and Pi are given by 


or,* = D U 'L 
Pi = D Wi IL 
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One can think of the collapsed state diagram of Fig. 1 
as the state diagram of a finite-state machine, with input 
alphabet {0, 1} and output alphabet the set of monomials 
D w I € L. If this machine is in state i and its input is 0, its 
next state is min(i + -f 1), and its output is D U 'L ; 
if it is in state i and its input is 1, its next state is 0 
and its output is D W% IL. Note that, as for the original 
state diagram, any path in the collapsed state diagram 
is specified by its initial vertex and its input string. For 
example, the path 2 — >0— ► 1 — >2 in the collapsed state 
diagram of Fig. 3 has initial vertex 2 and input string 100. 
Its weight is L z I. 

The important point is that the collapsed state dia- 
gram is equivalent to the original state diagram for pur- 
poses of computing the T(D, 7, L) transfer function for the 
ZRL code. This is because a path in the original DIL 
state diagram with an initial vertex with ZRL = i and in- 
put string <7 will have the same weight as a path in the 
collapsed state diagram with initial vertex i and the same 
input string < 7 . For example, the path in the state dia- 
gram of Fig. 2 with initial vertex 00 and input string 100 
has weight D W2 + U ° +Ul L 3 /, which is the same as the weight 
of the path in the collapsed state diagram of Fig. 3 with 
initial state 2 and input string 100. 

It follows then that the T(D,7,L) transfer function 
for a ZRL code is the sum of the weights of all paths in 
the collapsed state diagram of Fig. 1 from state K — 1 
back to state 7\ — 1, with no intermediate returns to state 
K - 1. This transfer function is denoted by T^_ lK _ v 
One way to compute K _ x is to remove the vertices 

1,2,...,7l — 2 from the state diagram, but to preserve 
the path label information by relabelling the remaining 
edges appropriately, as shown in Fig. 4. For example, in 
Fig. 4, the edge from vertex 0 to vertex — 1 is labelled 
■ * • &K-2] this is because in Fig. 1 there is exactly 
one path from vertex 0 to vertex K — 1 that uses only the 
deleted vertices {1,2, . . . , K — 1}, viz. 012 • * • (7\ — 1), and 
its weight is aoa\ • * • ock- 2- Similarly, the loop at vertex 0 
is relabelled to reflect the fact that there are I\ — 1 paths 
from vertex 0 back to vertex 0 which use only the deleted 
vertices: 00, 010, 0120, . . . , 012 • • • (7\ — 2)0, and the sum 
of the weights of these K — 1 paths is (3q -f c*o/?i + * • ■ + 
oro * • * a K-3PK-2> which is the label on the loop at vertex 0 
in Fig. 4. 


path (K — l)0(K — 1) divided by 1 minus the weight of the 
loop at vertex 0, i.e., 

T* — 

1 ~ 


ape *! • ■ - olk-iPk-\ 

1 — A) — — C*0«l/?2 — ■ • ' — o 0 ■ ■ ■ (*K-3Pk-2 


If one substitutes the above values for o, and A into this 
expression, and uses the definition of the profile, the ex- 
pression for T(D,7,L) in the statement of the theorem is 
obtained. 

8. Corollary. For a ZRL convolutional code with 
profile (di, . . . , d#), the free distance is dx and 


T num (D) = 


D dK 

P(D) 


D d K 

T bit( D ) = p( D y 


TlenP) = 


D iK Q{D ) 
P{ D) 2 


where the polynomials P(D) and Q(D ) are defined by 


K-i 

P{D) = \-Y,D d ' 

i - 1 


Q(D) = K - - i)D d ' 

1 = 1 

Proof: This follows directly from Theorem 7 and the 
definitions of T nU m(77), 7bit(7)), and 7] e n(7}) given at the 
beginning of the article. 

9. Example. Continuing Examples 3 and 6, the 
profile is (3, 3,7), and so P(D) = 1-2 D 3 , Q(D) = 3- 3D 3 . 
Thus, by Corollary 8, df ree = 7, and 


Once the state diagram has been reduced to only two 
states, the computation of the transfer function TfC-l.K-l 
is straightforward. Any path from vertex K — 1 back to 
vertex K — 1 with no intermediate return to vertex K — 1 
in Fig. 4 must be of the form (I\ — 1)0 • • ■ 0(I\ — 1), and so 
the desired transfer function is equal to the weight of the 


T num (D) = 


D 7 

l -2D 3 

D 7 + 2D 10 + 4D 13 
+ 8 D 16 + 16D 19 + 32D 22 


+ ... 
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Tbit(D) = (j _ 22)3)2 

= D 7 + 4D 10 + 12D 13 
+ 32D 16 + 80D 19 + 192D 22 + • • • 


and, for K > 3 


0 1 ■■■!<- 3 K - 2 I< - 1 


= ° 
1 


/V - 3 

\2*" 3 


2^—3 ... 2 k ~ 3 

2 k ~ 3 ... 2 k ~ 3 


2 k ~ 2 

0 


0 


2 k-2 


T len (D) = 


D 7 (3 - 3D 3 ) 

(1 - 2D 3 ) 2 

3D 7 + 9D 10 + 24D 13 + 60D 16 
+ 144D 19 + 336D 22 + • ■ ■ 


IV. Superorthogonal and Ultraorthogonal 
Codes 

Next, two important general classes of ZRL convo- 
lutional codes, the superorthogonal codes introduced by 
Viterbi [1] and the ultraorthogonal codes introduced here, 
are defined. 


10. Definition. The superorthogonal code of con- 
straint length K j denoted by Sk , is defined as follows: 
Si = (1), and for K > 2, then Sk is a (2^ -2 ,l) code 
whose generator polynomials are all 2 K ~ 2 possible poly- 
nomials of the form 1 + g\x + f- gx- 2 % K ~ 2 4- x K ~ l . 


Similarly, the weight tables for the ultraorthogonal codes 
are as follows: 


=!(!!) 


0 

W(V*) = J 



1 

0 

0 


0 1 2 


0 

W(U 3 ) = { 



and, for K > 3 


11. Definition. The ultraorthogonal code of con- 
straint length K } denoted by Uk> is defined as follows: 
U i = (0), and for K > 2, then Uk is a (2 K ~ 2 , 1) code 
whose generator polynomials are all 2 K ~ 2 possible poly- 
nomials of the form g\x 4- h gx- 2 ^ K ~ 2 4- x K ~ l . 


0 1 


0 ( 2^“ 3 2 k ~ 3 

W(U k ) = 

1 \^ 2^- 3 2*’ 3 


I< - 3 I<- 2 K- 1 
2 k ~ 3 2 k ~ 2 0 
2*~ 3 2 k - 2 0 


12. Example. For K = 3 the code 53 has generator 
polynomial list (1 -f x 2 , 1 4* x 4- x 2 ), and U 3 has generator 
polynomial list {x 2 ) x 4* x 2 ). 

13. Theorem. For all K > 1, the codes Sk and Uk 
are ZRL codes. The weight tables for the superorthogonal 
codes are as follows: 


Proof: The key to the proof is the close relation- 
ship between the convolutional codes Sk and Uk and the 
first-order Reed-Muller (1RM) block codes, which are now 
described. The (2 m ,;n4- 1) 1RM code can be defined by an 
(m4*l) x 2 m generator matrix G m which has as columns all 
possible binary (m+ l)-tuples ending with 1. For example, 
with m = 2 the (4, 3) 1RM code has generator matrix 




o 

W(Si) = 1 




0 

W(S 3 ) = 1 



1 2 
2 0 

0 2 


( 001 1 \ 
0 10 1 
1111/ 


It is known that all weights in the (2 m ,m + 1) 1RM 
code are equal to 2 m_1 , except for the all-zero word and 
the all-one word ([8], Chapter 13). If G ^ is defined to be 
the matrix obtained by adding a row of zeros at the top 
of G m , and G’ n to be the matrix obtained by adding a 
row of ones at the top of G m , then the columns of 2 
give the coefficients of the generator polynomials of Uk 
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and the columns of G l K _ 2 give the generator polynomials 
of Sk . For example, again with m = 2, 


/o 

0 

0 

°N 


(l 

1 

1 

i \ 

0 

0 

1 

1 

G \ = 

0 

0 

1 

1 

0 

1 

0 

1 


0 

1 

0 

1 

V 

1 

1 

ij 


V 

1 

1 

V 


It therefore follows that every (2 K “ 2 )-bit code segment in 
either of the codes Sk or Uk is a word in the (2 K ~ 2 , A'— 1) 
1RM code. In almost every case, this segment will have 
weight 2^~ 3 ; the only other possibilities are weight 0 (the 
all-zero codeword) and weight 2^~ 2 (the all-one codeword). 
To analyze these exceptional cases, note that every lin- 
ear combination of rows of either G ^ or G ln is a word in 
the 1RM code. All such linear combinations will therefore 
have weight 2 m with the following exceptions. In 
the empty linear combination, or the top row, give the all- 
zero codeword; and the bottom row, or the top row plus 
the bottom row, give all ones. In G^, the empty linear 
combination or the top row plus the bottom row gives the 
all-zero codeword; and the top row or the bottom row gives 
all ones. 

Therefore, in the ultraorthogonal code Ux> the code 
segment will be all zeros if and only if the state is 0^ -1 
and the input is 0, or the state is O*' 1 and the input is 
1. Similarly, the code segment will be all ones if and only 
if the state is O^” 2 ! and the input is zero, or the state is 
0*“ 2 1 and the input is 1. Thus, the output weight will be 
2*“ 2 unless the state has ZRL = K — 1 and the input is 0 
or 1, in which case the output weight is 0, or if the state 
has ZRL = K — 2 and the input is 0 or 1, in which case the 
output weight is 2 K ~~ l . This is what the theorem states 
about the ultraorthogonal codes. 

Similarly, in the superorthogonal code Sk } the code 
segment will be all zeros if and only if the state is 
and the input is 0, or the state is 0^ -2 l and the input is 1. 
Similarly, the code segment will be all ones if and only if 
the state is O^ -1 and the input is 1, or the state is O* -2 ! 
and the input is 0. Thus, the output weight will be 2 K ~ 2 
unless the state has ZRL = K — 1 and the input is 0, or 
if the state has ZRL = K - 2 and the input is 1, in which 
case the output weight is 0; or if the state has ZRL = K — 1 
and the input is 1, or if the state has ZRL = K — 2, and 
the input is 0, in which case the output weight is 2 K ~ l . 
This is what the theorem states about the superorthogonal 
codes. 


Theorem 13 provides many ZRL codes. The following 
definition and the discussion that follows will show how 
to use the superorthogonal and ultraorthogonal codes to 
build many other ZRL codes. 

14. Definition. Given two convolutional codes, their 
sum is defined to be the convolutional code whose genera- 
tor polynomial (g.p.) list is obtained by merging the g.p. 
lists for the original codes. Thus for example, the sum of 
the (3, 1) code with g.p. list (1, 1 + x, 1 + x + x 2 ) and the 
(2, 1) code with g.p. list (1 + x 2 , 1 -j-x + x 2 ) is the (5, 1) code 
with g.p. list (1, 1+x, 1 + x 2 , 1 + x + x 2 , 1 + x + x 2 ). In gen- 
eral, the sum of an (ni, 1) convolutional code of constraint 
length Ki and an (n2, 1) convolutional code of constraint 
length K 2 is an (ni+n2, 1) convolutional code of constraint 
length max(ATi , K 2 )> 

15. Lemma. If C\ and G2 are ZRL convolutional 
codes, with constraint lengths I\\ and K 2 , respectively, 
with Afi < K 2y then C\ + G2 is also ZRL, and the weight 
table for C\ + C 2 is obtained from the weight tables W\ 
and W 2 by first extending W\ by repeating its last column 
I< 2 — K\ times, and then adding the two weight tables 
together. 

Proof: If the two codes have the same constraint 
length, this is immediate. If, however, the two constraint 
lengths are different, and K\ < K 2 , C\ can nevertheless 
be regarded as a convolutional code with constraint length 
K 2 in which the last K 2 — K\ bits in the shift register are 
never used. States with ZRL values K\ t K\ + 1, . . . , K 2 — 1, 
will plainly behave just like the all- zeros state (with ZRL = 
K\ — 1), and the extra I( 2 — K\ columns that appear in 
the weight matrix will be identical to the last column of 
the unextended weight matrix. The result now follows. 

16. Example. The code of Example 3 is S\ + 1/2 + S3, 
as may easily be verified. The corresponding weight tables 
are, by Theorem 13, 

<ns.) = (J) 

HW=(i 2) 
w(s S )=(; 2 2) 

To obtain the weight matrix for S\ + £^2 + S3, first extend 
W^Si) and W(U 2 ) to dimensions 2 x 3 by repeating the 
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respective last rows, and then adding the resulting matri- 
ces: 

*-(; ; iH\ 

(2 2 0 \ 

\3 1 3j 

which is the same as was seen in Example 3. 


19. Example. By combining Theorems 7 and 18, one 
can obtain the transfer function for the superorthogonal 
codes. Indeed, if z — D 2 "~* , it follows from these theorems 
that for the superorthogonal code of constraint length K , 

T(D,I,L ) 

z k+2 1L k 

= 1 - zIL( 1 + zL + • • • + - ^- 2 IL K-l 



17 . Example. For any I\, the code (Sj + Ui) 
is by Lemma 15 a ZRL code. In fact, this code has as 
generator polynomials all 2 ^ polynomials of degree < A — 
1; it is the orthogonal code of constraint length I\. 

18 . Theorem. The profiles of the codes Sk are: 


profile(Sj) = (1) 
profile(S 2 ) = (0, 2) 
profile(S 3 ) = (1,1,5) 
profile(5 4 ) = (2,4,4, 12) 
profile^) = (4, 8, 12, 12, 28) 

profile(5if ) = ( 2 k ~ 3 , 2 • 2 K ~ 3 , . . . , 

(K — 2)2 K ~ 3 , (I< - 2)2 k_3 , 
(. K + 2)2 K ~ 3 ) 


The profiles of the codes Uk are 


z k+ 2 IL k ( 1 - zL) 

= l-z(L + IL) - z K ~ 2 IL K ~ 1 + z K ~ l {IL K - 1 + IL K ) 

an expression first found by Viterbi [1]. It follows then 
from Corollary 8 that df ree = (K + 2)2^ 3 and 


Tnum{D) = 


z K + 2 (1 - z) 
l-2z - z K ~ 2 + 2z K ~ l 


z K+2 ( 1 - z) 

~ {\-2z)(\-z K ~ 2 ) 

r 2 k ~ 3 

- Z K + 2 \ - 

\ (2 k ~ 2 — 1)(1 — 2z) 

( 2 k-3 _ 1) _ z _ 2z 2 2 k ~ a z k ~ 3 \ 

+ ( 2 K-2_l)(l_ 2 K--i) j 

In the last expression, a two-term partial-fraction decom- 
position is seen (in braces) for the generating function 
T num (D)/z K+2 . The coefficient of z k in the expansion of 
the first term is 


2 k ~ 3 
2 *“ 2 - 1 


• 2 k 


profile(f7j) = (0) 
profil e{U 2 ) = ( 1 , 1 ) 
profile(i7 3 ) = (1,3,3) 

profile(f/ 4 ) = (2, 4, 8, 8) 

profile^) = (4,8,12,20,20) 
profile^*) = (2 k ~ 3 ,2- 2 k ~ 3 

(I< - 2)2 k ~ 3 , K2 k ~ 3 , K2 k ~ 3 ) 

Proof: This follows by combining Theorem 13 and 
Lemma 5. 


The coefficients of the expansion of the second term are 
periodic of period K — 2, and each term is less than 1/2 
in absolute value. Since it is known that the coefficient of 
z k in the combined expansion is an integer, it follows that 
this coefficient must be the integer closest to 


Therefore, it has been proved that the coefficient 
of D dt ree +*2 K 3 J n Tnum(^) for the superorthogonal code 
of constraint length K is 

2^-3 

N du „+k = integer closest to 2K ^~ ' 2 * 
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As a special case, it is found that the (8, 1), K ~ 5 su- 
perorthogonal code has df ree = 28, and the number of fun- 
damental paths of weight 28 + 4k is the integer closest to 
| • 2*, i.e., 

T num (D) = D 28 + D 32 + 2D 32 + 5D 36 

+ 9 D 40 + 1 8U 44 + 37 D 48 + 0(D S2 ) 


V. A Representation Theorem 

If Theorem 13 is combined with Lemma 15, many ZRL 
codes can be constructed. It is surprising (and perhaps 
disappointing) that all such codes are constructed this way, 

20. Theorem. An (n, 1) convolutional code C of 
constraint length K is ZRL if and only if it is the sum of 
copies of superorthogonal and ultraorthogonal codes: 

K 

c = 5 + TliUi) 
i - 1 

where m* and n t - are integers denoting the multiplicities of 
Si and Ui in the code C . 

Proof: The proof of this theorem is lengthy and will 
be omitted. 

The next lemma, when combined with Theorems 20 
and 18, enables one to write down the transfer functions 
for any ZRL convolutional code. 

21. Lemma. If C\ and C 2 are ZRL convolutional 
codes, with constraint lengths K\ and K 2 respectively, 
with I<i < K 2i then the profile for the sum C\ + C 2 is 
obtained from the profiles for Ci and C 2 by first extending 
profile^i) to length K 2 by repeating its last entry K 2 —I\i 
times, and then adding the two profiles together. 

Proof: This follows by combining Lemma 15 with 
Lemma 5. 

22. Example. The ZRL code in Example 3 is C = 
Si+Vt+S*, as was seen in Example 16. The corresponding 
profiles are, by Theorem 19, 

profile(Si) = (1) 

profile(t/ 2 ) = (1, 1) 

profile(S 3 ) = (1,1,5) 


To obtain C’s profile, use Lemma 21. First extend the 
profiles of Si and U 2 to length 3 by repeating the last 
entries, and then add the resulting lists: 

profile(C) = (1, 1, 1) + (1, 1, 1) + (1, 1, 5) = (3, 3, 7) 

as was seen in Example 6. However, for the same values 
of n and K> one can get a larger df ree by considering the 
code 2 S 3 instead, since its profile is 2(1, 1,5) = (2,2,10), 
so that dfree = 10. And in fact, for n = 4 and K = 3 
this is the largest possible free distance, since the Plotkin 
bound for these parameters gives df ree < 10. In general, 
for (n, 1), K = 3 ZRL codes, the largest possible df ree 
LtJ’ thieved by + (n mod 2)S 2 , whereas the 

best possible df ree among all codes, ZRL or not, is [-J, 
achieved by + * 2 ) + + x + * 2 )- The 

ratio of these two values approaches 16/15 as n — ► oo, 
and the smallest value of n for which these two values 
differ by as much as two is n — 9, where the best ZRL 
code 4S 3 + S 2 has df ree = 22, but the code with g.p. list 
(3(1 + x 2 ), 6(1 + x + x 2 )) has df ree = 24. However, even in 
this case the ZRL code may be competitive, since its T num 
is 

D 22 

— ~ D ' - = ° 22 + 1)26 + 1)28 + H 30 + °( D32 ) 

whereas the unrestricted code has 

t — = = 2D " + 5030 + °( d3 “) 

And indeed, an asymptotic analysis shows the rate 
of growth of the coefficients of T nnm (D ) for the ZRL code 
to be « (1.1577) n , whereas for the unrestricted code it 
is ft: (1.1740)”. Thus, as discussed in [2], the ZRL code 
may perform better at low signal-to-noise ratios than the 
non-ZRL code. 


VI. Summary 

A class of convolutional codes, termed zero-run length 
(ZRL) convolutional codes, has been discovered for which 
the free distance can be computed by inspection, and for 
which there is a closed-form expression for the three- vari- 
able transfer function. This class of codes includes the su- 
perorthogonal codes introduced by Viterbi [1] and analo- 
gous “ultraorthogonal” codes introduced here. It has been 
found that, while ZRL codes are much more general than 
superorthogonal or ultraorthogonal codes, any ZRL code 
may be constructed as a combination (“sum”) of super- 
orthogonal and ultraorthogonal codes. 
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Although ZRL codes have very low rates for large con- 
straint lengths, they are nevertheless interesting and po- 
tentially useful. Furthermore, many of the ideas developed 


here to analyze this class of specialized codes, such as the 
use of reduced state diagrams, might extend to other in- 
teresting code classes as well. 
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A Viterbi decoder’s performance loss due to quantizing data from the additive 
white Gaussian noise (AWGN) channel is studied. An optimal quantization scheme 
and branch metric calculation method are presented . The uniformly quantized 
channel capacity C u (q) is used to determine the smallest number of quantization 
bits q that does not cause a significant loss . The quantizer stepsize which maximizes 
C u (q) a/most minimizes the decoder bit error rate (BER). However, a slightly larger 
stepsize is better, like the value that minimizes the Bhattacharyya bound. The range 
and renormalization of state metrics are analyzed, in particular for K = 15 decoders 
such as the Big Viterbi Decoder (BVD) for the Galileo mission. These results are 
required to design reduced hard ware complexity Viterbi decoders with a negligible 
quantization loss. 


I. Introduction 

Theoretically, Viterbi decoding is a maximum- 
likelihood decoding algorithm for convolutional codes. In 
practice, the main performance loss results from quantiz- 
ing input data with q bits. The decoder’s hardware com- 
plexity and speed depend strongly upon q and the state 
metric register length t. Therefore, these parameters must 
be chosen as the smallest values that do not cause a signif- 
icant bit signal-to-noise ratio (Et/No) loss. A constraint 
length K = 15 decoder performs double the computation 
of a K = 14 decoder, but requires about 0.1 dB less Eb/N 0 
for a bit error rate (BER) of 0.005. Since part of the de- 
coder’s hardware complexity increases only linearly with q , 
even a 0.01-dB quantization loss is large. However, given 


that one must construct a fully parallel K = 15 (or K = 7) 
decoder, a slightly larger loss might be acceptable or re- 
quired by hardware and speed constraints. 

The uniformly quantized, additive white Gaussian 
noise (AWGN) channel capacity C u (q ) is used to estimate 
, the quantizer stepsize A and smallest q that result in a 
negligible loss. For each q , almost minimum BER occurs 
when A maximizes C'u(g) or minimizes the Bhattacharyya 
bound y. New methods are presented to minimize the 
state metric register length £ in bits. These estimates 
are verified by simulations of three codes : the constraint 
length K = 7, rate R = 1/2, NASA standard code; the new 
experimental K = 15, R — 1/4, Galileo code [1]; and the 
I< = 15, R = 1/6, “2-dB” code [2]. 
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These results are used to determine the best design 
parameters g, £, and A for K = 15, rate 1/4 and rate 1/6 
decoders. Using 6 quantization bits with 10-bit state met- 
ric registers would substantially reduce the Big Viterbi De- 
coder (BVD) [4] hardware complexity and allow the sys- 
tem clock frequency to decrease by a factor of 0.56 as com- 
pared to the current design, which has q — 8 and t— 16. 
Using q = 5 would cause an E b /No loss of 0.02 dB at a BER 
of 0.005 for the Galileo code or “2-dB” code, but there is 
no measurable E b /No loss when q = 6 . Since the same 
losses occurred for 8 -bit symbol error rates (SER), these 
results apply when an outer block code is concatenated 
with the convolutional code. 


II. Branch Metrics 

When an encoded 0 or 1 is mapped to -hi or — 1, 
respectively, and then transmitted, the receiver’s demod- 
ulator output is a conditionally Gaussian random vari- 
able y with mean -fm or — m and the same variance 
a 2 = m 2 /(2REb/No) as the zero-mean AWGN channel 
noise. (This holds for binary phase shift-keyed [BPSK] 
signaling with ideal coherent detection.) 

For the AWGN channel, a Viterbi decoder finds the 
trellis path with minimum Euclidean distance (or equiv- 
alently, minimum negative inner product) to the received 
sequence. Thus, each trellis branch metric is the inner 
product of the length n branch label (with 0 and 1 re- 
placed with -f 1 and — 1 ) and the negative of a received 
vector [t/i , J/ 2 > • • • , Vn\- Hence, the decoder adds —y, or 
+ Vi (equivalently (-y t - + |y ,|)/2 or (y* + |y ,|)/2 when a is 
fixed, because incrementing or multiplying all branch met- 
rics by a constant does not change the decoder’s output) 
to the metrics of those branches with a -f 1 or —1 in posi- 
tion *. Therefore, the decoder may add |y,| to the metrics 
of branches having different signs in position i than that 
of y t , and zero otherwise. This sign-magnitude method is 
used throughout this article because it halves the branch 
and state metric maximum ranges, as compared to using 
standard integer metrics [3,4]. For example, using this 
method in the Scarce-State I\ = 7, rate 1/2 decoder [5] 
would substantially decrease the chip circuitry. 


III. Quantization 

When zeros and ones are equally likely in the encoder 
input data, 

Pr(|j/| = *) = 1 [Pr(y = x | +1) + Pr(j/ - -x | +1) 


+ i[Pr(y = x|-l) + Pr(y=-x|-l)] 
= Pr(y = x | +1) + Pr(y = x | -1) 


y/2iT<j L 


,-(ar-m) 3 / 2<r 3 m) 3 /2<7 2 


In this article, m = 0.84 volts. The probability distribu- 
tion function of |y| (Fig. 1) suggests that more quantiza- 
tion levels are required for the K = 15 codes operating 
near 0 dB (high noise variance) than the NASA code at 
E b /NQ = 2.25 dB. 


Let the random variable J be the quantized value of 
y and for — 2 ?_ 1 — 2 < j < 2^ -1 — 2 define 


Pi = p t(J = j i +i) 


(j+0.5)A 


x/2tt< 


7TCT 


/ 


^-(y-m) 3 }2a‘ 


dy 


U~ 0.5)A 


For j — ±(2 q ~ l — 1), pj is the above integral with 
limits (j — 0.5)A and -foo, or — oo and ( j + 0.5)A. 

Since |Ji|, . . . , \ J n \ are summed to form branch met- 
rics, the absolute error | J,- — y t | in quantizing y t is also the 
contribution to the branch metric error incurred. A de- 
coder using signed integers to represent J, could conceptu- 
ally use 0, ±A, ±2A, . . . ,±(2*”* - 1)A for any real num- 
ber A, because multiplying all metrics by A has no effect. 
Therefore, the quantizer thresholds should be uniformly 
spaced A volts apart at ±A/2, ±3A/2, . . . , ±(2 7 -l )A/2, 
because this minimizes the metric error defined above (and 
also any positive function of — y j). Thus, only uniform 

quantization schemes, characterized by q and A, are con- 
sidered herein. (Several simulations of the NASA code 
using 3 -bit integer branch metrics and nonuniform quan- 
tization schemes never produced lower BERs than using 
the best A). 

For = 3, Ji is normally one of 7 values from —3 to 
+3, so quantizer levels +4 and —4 are appended (Fig. 2) to 
decrease the BER near that for 8 levels and standard inte- 
ger metrics. Thus, the maximum magnitude of J, 2 ?-1 — 1 , 
will be replaced by 4 instead of 3 for q — 3 throughout this 
article. In rate 1/2 decoders, a branch metric of 8 is de- 
creased to 7 so that q — 3 bits still represent all possible 
values. Since Pr(|J,*| = j) = Pj this event occurs 

with probability (p +4 -f P- 4 ) 2 , which is only 0.11 for the 
NASA code at E b /N 0 = 2.25 dB. 
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IV. Quantizer Stepsize 

Ideally, the uniform quantizer stepsize A should min- 
imize BER and SER over the decoder’s operating range of 
channel noise levels. In practice, a A which almost min- 
imizes the BER for the lowest expected E b /N o will also 
nearly minimize both the BER and SER when E b /No in- 
creases by up to 1 dB. Simulations (described later) indi- 
cate that the A that maximizes channel capacity is near 
optimum. 


The A that minimizes 7 is the safest choice because it 
is slightly larger than the stepsize which minimizes BER. 
Also, minimizing 7 yields the lowest BER for q = 3 with 
9 quantizer levels (Fig. 4). Finally, the corresponding 8- 
bit SER curves are not shown because they have the same 
relative shape and spacing as the BER curves in Fig. 4. 
Many sets of software simulations were run for the NASA 
code and the Galileo code. The values of q were 3, 4, 5, or 
6 and E b /No ranged from 0 dB to 3.5 dB. 


Since the binary-input quantized AWGN channel is 
symmetric, capacity is achieved with equiprobable inputs: 


2 < *~ 1 -1 

C u(q) = ^2 p{j | +1) log 2 

j'=-2»- I + l 


Mj I +1) 

p(J I +1) + p(j I -1). 


= 1- 



j=- 2«- , +l 


Pj log 2 




bits per channel use. Figure 3 shows how rapidly the max- 
imum possible uniformly quantized AWGN channel capac- 
ity C u (q) approaches its limit for several noise variances; 
C u (3) is based upon the 9-level quantizer in Fig. 2 instead 
of using 8 or 7 levels. Ag = 4org = 5 quantizer has 
15 or 31 levels, respectively. The data points in Fig. 2 
indicate C u (q) for integer values of q. The lines between 
data points are channel capacities when uniform quantiz- 
ers have intermediate numbers of levels, such as 24. 

The curves in Fig. 3 show that there is negligible 
capacity gain for q > 6, and in fact (7 u (5)/C u (oo) > 0.9975 
suggests that there will be a very small loss for q = 5. 
Figure 4 shows how the performance of the NASA code 
at Eb/No = 2.25 dB varies with q and A. Observe that 
the minimum BER for q — 5, 4, and 3 increases roughly in 
proportion with the decrease in capacity. Also, for <7 = 5, 
there is a negligible loss and the BER increases extremely 
slowly for A greater than the optimum. Therefore, for q > 
4, it is important to choose A larger instead of smaller than 
the best value. The labels C and 7 in Fig. 4 indicate the 
stepsizes that respectively maximize C u (<?) and minimize 
the Battacharyya bound parameter 


t = VpIp^J 

j=- 2*- , + l 

which is a measure of the channel noise level: near 0 for 
high Eb/No and approaching 1 for very noisy channels. 


In all simulations, the As which maximize C u (q) or 
minimize 7 were, respectively, slightly smaller or larger 
than the A that minimized BER. For q = 3 or 4, the As 
which minimize the quantizer mean-square error or abso- 
lute error were too large. 

The simulations in Fig. 5 for the K = 15 codes show 
that using q — 5 or 4 costs 0.02 dB or 0.05 dB at the 
BER of 0.005 required for images. These E b /N 0 quantiza- 
tion losses are the same when the Viterbi decoder output 
becomes the input to an outer block decoder, because the 
8-bit symbol error rate curves are spaced the same distance 
apart as the BER curves. In all simulations, the uniform 
spacing A was chosen to minimize 7. 


V. State Metric Renormalization 

For each received n-vector and encoder state, a 
Viterbi decoder finds the trellis path with least total 
branch metrics into the state. Since the state metrics are 
stored in £-bit registers, occasionally they must all be de- 
creased to avoid overflow. This renormalization can be ac- 
complished by zeroing every register’s most significant bit 
(msb), which is equivalent to subtracting 2* -1 from every 
metric if all registers have msb = 1. However, detecting 
when all 2 K ~ l metrics simultaneously have msb = 1 is 
impractical for a K = 15 decoder such as the BVD. 

At each trellis level, let the random variable M be 
the difference between the maximum and minimum state 
metrics. If any state metric is > 2*~* + 2^“ 2 , (its two 
most significant bits are 1) and M < 2*“ 2 , then all met- 
rics are > 2*” 1 , so every msb = 1. In the BVD, £ — 16 
was chosen to guarantee that 2*“ 2 > Af , and so a sin- 
gle state metric is monitored and renormalization occurs 
when the two most significant bits are 1. The following 
improved method should be used when £ is reduced so 
that M > 2*“ 2 . Let W be the maximum of the metrics of 
the all-zeros state, the all-ones state, and the state with a 
one input followed by K — 2 zeros. Since most state met- 
rics differ from one of these three metrics by only a few 
|J,| contributions, W is close to the largest state metric 
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(Galileo code simulations verified this). Therefore, renor- 
malization could occur when W exceeds a threshold such 
as 2*” 1 + 2*” 2 + 2*” 3 . If more metrics are monitored, the 
threshold can be set closer to 2* - 1 because W will be 
closer to the largest state metric. 

Definition. Let D be the maximum, over all nonzero 
states s, of the least-weight trellis path from the all-zeros 
state into state s . 

Lemma. M < D( 2 q ~ l — 1) 

Proof. Let b and w be the states with lowest and 
highest metrics. Since a convolutional code is linear, there 
exist two trellis paths from some state c, one into state b 
and the other into state w> whose branch labels differ in 
D or fewer positions. Since the maximum contribution to 
a branch metric by one J{ is 2 q ~ l — 1, the state metric of 
w is at most the state metric of b plus D(2 q ~ l — 1). 

Corollary. In the absence of noise, 

M = M o = D • j m 

where j m — [0.5 + m/Aj is the quantizer output when 
-f m volts is input. 


where the subscript i denotes the coefficient of x x in the 
polynomial within the braces. 

Proof. Let 6 and w be the states with lowest and 
highest metric. An upper bound on Pr(A/ = <) is obtained 
by considering the worst possible case: the survivor path 
for state w differs from that for state 6 in exactly D po- 
sitions, and in these positions, the survivor path branch 
labels of w have a different sign than the received J*. Then 
Pr(M = t) is the coefficient of in [m(x)] - 

To achieve a particular (very low) probability, t must 
be unrealistically large since the above bound is not very 
tight. This is fine, because t could be chosen as the least 
power of 2 such that Pr(M > t) < 10” 5 . Then setting 
£ = 1 + log 2 f results in no loss of performance. 

The current BVD design has q = 8 and £ = 16 to ac- 
commodate M < n(I\ — 1)(2 9 ” 1 — 1) and two extra bits for 
renormalization. This results in full maximum-likelihood 
decoder performance. However, using <7 = 6, A = 0.14, and 
^=10 for the BVD operating at 0 to 1 dB Eb/No would 
not increase the BER or SER detectably, but would reduce 
the decoder hardware. Furthermore, the system clock fre- 
quency and thus timing constraints would be reduced by 
a factor of 10/18, 


For nonsystematic codes, D is near df ree and usually 
much less than n(I< - 1), the maximum possible. The 
NASA code has df ree = 10, D = 8, and n(K — 1) = 12. 
Since D = 33 for the Galileo code, Mq = 132 for q = 5, 
A = 0.20, and m = 0.84. Since D — 50 for the rate 1/6 
“2-dB” code, Mq — 200. Simulations for the Galileo and 
NASA codes show that Mq is an upper bound on the mean 
of M when the channel is noisy and 2 Mq is always greater 
than M . 

As in the q = 3 case where levels 4-4 and -4 were 
adjoined, a rule for limiting branch metrics may be derived 
by computing their probabilities. Define 


2 9 ** 1 — 1 

™(z) = Po + ( p i + 

3 = 1 

Then Pr(|Ji| = j ) = {m(a;)} ., the coefficient of in 
m(x). Since the largest possible branch metric is the sum 
of n independent values | J, |, . . . , | J„[, it equals t with prob- 
ability {[ro(^)] n }r Thus M could be reduced by limiting 
branch metrics. 


Claim. Pr (M>t)< ^ | [m(x)] j 

i=t * 


When £ < 1 + log 2 [D(2 9_1 - 1)], then occasionally a 
state metric may overflow, whereupon it is immediately 
decreased by 2*, instead of 2*” 1 at the next renormaliza- 
tion. Protecting against overflow is important because a 
state with a high metric might suddenly become one of the 
best states, causing the decoder to make wrong decisions. 
This can be avoided by setting state metrics that overflow 
equal to all ones (2 l — 1). Then states with very high met- 
rics remain this way even after renormalization so they do 
not affect the decoder’s output. An underflow is the event 
that occurs at renormalization when a state metric has 
msb = 0, in which case the metric is effectively increased 
by 2*” 1 . Rarely, underflows may occur because it is infea- 
sible to continuously check all 2 K ~ l state metrics to find 
the least value. In conclusion, overflows can be prevented 
by extra hardware, but underflows will occasionally hap- 
pen. In practice, always examining several state metrics 
gives a good approximation of the current metric size and 
range M. Hence, renormalization can take place so that 
overflows and underflows occur with very low probability. 

Myth. When state metrics overflow or underflow, 
the decoder fails completely. 

One million decoded bit simulations for q = 5 and 4 
with short state metric registers having £ = 9 and 8 bits, 


118 



respectively, yielded the same results as in Fig. 5, because 
the odd underflow or overflow that occurred did not signif- 
icantly affect the output. This follows from the Viterbi de- 
coder’s robustness and tolerance of occasional state metric 
disruptions. Further shortening of the state metric regis- 


ters to 8 and 7 bits resulted in a graceful BER increase, 
as though q was being decreased. This behavior is ex- 
pected because the overall trellis path metric resolution 
is the decoder parameter, affected by input quantization, 
which influences decisions. 
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Fig. 1. Received signal magnitude distribution. 



Fig. 3. Uniformly quantized AWGN channel capacities. 



Fig. 2. Optimal quantization for 3>b!t branch metrics. 
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The Big Viterbi Decoder (BVD), capable of decoding convolution a/ codes with 
constraint lengths of up to 15, is under development for the DSN. As part of the 
development, a commercial single-chip (7,1/2) Viterbi decoder is used to enable 
early start of system integration. Tests of the integrated partial system (including 
simulator , input interfaces , output interfaces , and computer controls) were recently 
completed at the DSN Compatibility Test Area (CTA-21) at JPL . This article 
describes the system elements used for the demonstration and test results. 


I. Introduction 

The Big Viterbi Decoder (BVD) is under develop- 
ment for the Deep Space Network (DSN) [1]. It is in- 
tended to provide up to 1.8 dB improvement in link margin 
through the use of convolutional codes with larger con- 
straint lengths. Specifically, the BVD is designed to op- 
erate with any convolutional code with constraint length 
of K < 15 and code rates 1/2, 1/3, . . . , 1/6. In contrast, 
the current equipment is designed for the standard DSN 
code, K — 7 and rate 1/2. The BVD prototype will be 
used in a May 1991 demonstration in conjunction with the 
Galileo mission. Following a successful demonstration, the 
decoder will be inserted into the DSN for use with Galileo 
and future missions. 

A block diagram of the BVD, as initially planned, 
is shown in Fig. 1(a). The core of the decoding pro- 


cessing is performed in the Processor Assembly using 256 
or 512 identical custom VLSI chips. All the other func- 
tions are performed in the Controller Assembly. These 
include transforming of input soft symbols, buffering data 
to the Processor Assembly, interfacing to output devices, 
and providing full self-test capability. Early on it was 
recognized that the Processor Assembly, especially the 
VLSI chips, is the time-critical element in the devel- 
opment schedule since no meaningful DSN-compatibility 
tests could be conducted without having a “decode” capa- 
bility, which requires a full Processor Assembly. To over- 
come this bottleneck, a secondary “decode” path was in- 
troduced as shown in Fig. 1(b). The additional path uses 
a commercial QUALCOMM Q1401 (7,1/2) decoder chip 
and enables testing of many BVD functions well before 
the Processor Assembly is ready. A partial BVD, shown 
in Fig. 1(c), was completed and tested in the laboratory 
and in the DSN Compatibility Test Area (CTA-21) at JPL, 
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verifying decoder operation for (7,1/2) code and current 
DSN interfaces. 


II. Functional Block Description 

The system under test consists of a MULTIBUS I 
chassis with six boards: an Intel 80386/21 CPU and five 
custom digital boards. Figure 2 shows a more detailed 
functional block diagram of the five custom boards. These 
boards include functions required in DSN operation, as 
well as functions needed during development and testing. 
Some of the latter are especially critical during diagnos- 
tics and fault isolation, enabling failures to be traced to 
a specific board. The following is a description of these 
functions, by board. 

A. Memory Board 

This board includes 1 Mbyte of Electrically Erasable 
Programmable Read-Only Memory (EEPROM) and is 
used for object code and key parameter storage. It en- 
ables the BVD to accept program updates without a ma- 
jor interruption: the new code is downloaded from an IBM 
PC/AT computer into RAM and stored in the EEPROM. 
Upon reset, the program is read from the EEPROM into 
RAM and executed. This approach eliminates the opera- 
tional problems associated with removing boards and re- 
placing EPROMs (which must be erased under an ultravi- 
olet light). The board also stores critical mission parame- 
ters to allow easy restart after a power glitch. If additional 
memory is required, multiple boards can be installed. 

B. Encoder Board 

The encoder (simulator) board provides a flexible 
source for encoded data test sequences for BVD self-test. 
In operation, a fairly comprehensive self- test with several 
million bits can be run at 1.1 Mbit/sec, requiring only a 
few seconds. The board includes an uncoded data source, 
encoder, and circuitry that gets calibrated noise samples 
from an external noise source. All functions are fully pro- 
grammable. Simulated symbols are generated by passing 
bits from a programmable sequence generator through an 
encoder, and summing them with properly scaled noise. 
The encoder is implemented as a computer-loaded RAM, 
where each state of the encoder shift-register corresponds 
to an address of the RAM. This allows implementing of 
any convolutional encoder for K < 15 and 1/n, n < 6 
through a single computer loading of the RAM. 

Other functions that the board performs are moni- 
toring the mean and variance of simulated noise samples 
(received from an external source), generation of simulated 


bit and symbol clocks, and transfer of simulated bits to the 
comparator board. 

C. SSA/BBA Interface Board 

This board includes circuitry that generates system 
clocks, interfaces to the Symbol Synchronizer Assembly/ 
Baseband Assembly (SSA/BBA) for input symbols, re- 
clocks input signals, and interfaces to the Time Code 
Translator (TCT) for time tagging. It also provides a sim- 
ulated symbol RAM that allows testing of the SSA/BBA 
interface, 

D. SNR Estimator Board 

The circuitry on this board collects data for sym- 
bol signal- to- noise ratio (SNR) estimation. The approach 
is similar to that used in the Symbol Stream Combiner 
(SSC) 1 and requires computation of sum-of-squares and 
sum-of- absolute- values of the received symbols. Comput- 
ing symbol SNR internal to the BVD provides an impor- 
tant diagnostic tool as well as a monitor of BVD health 
during real-time operation. 

The board includes a scale RAM and an alternate 
symbol sign-flipper to allow for computer-controlled ad- 
justment of these symbol attributes. The scale RAM al- 
lows one to scale the input symbols by a constant, which 
also enables testing for possible inversion in the SSA/BBA 
(the Galileo (15,1/4) code is nontransparent). The alter- 
nate symbol sign-flipper is required to compensate for the 
sign flipping used in encoders on board most JPL space- 
craft. Another feature on this board is a coded data test 
RAM that allows the CPU to read and store symbols from 
either the SSA/BBA (in real time) or the simulated signal 
generated by the encoder board. 

The board also includes provisions for future interfaces 
to the DSN through a First-In, First-Out (FIFO) buffer. 
This interface will be utilized when the new Telemetry Pro- 
cessor Assembly (TPA), currently under development, is 
sufficiently defined. 

E. Comparator Board 

The comparator board includes a (7,1/2) Viterbi de- 
coder, a comparator that allows bit error rate (BER) data 


1 S. Dolinar, “A Lot of Things You Always Wanted to Know 
About Signal- to- Noise Estimation Methods (but Didn’t 
Bother to Ask),” JPL Interoffice Memorandum 331-85.2-109 
(internal document), Jet Propulsion Laboratory, Pasadena, 
California, January 22, 1986. 
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collection, symbol rate estimation circuitry, and various 
output interfaces. 

The (7,1/2) Viterbi decoder section is centered on a 
QUALCOMM Q1401 commercial chip, and includes ad- 
ditional circuitry to provide for computer-controlled node 
synch. The Q1401 is designed for the Goddard (7,1/2) 
code, which uses the same polynomials as the DSN code 
but in a reversed order. A circuit that allows the BVD to 
operate with either polynomial order had been designed 
but had not been installed during the tests. It will be 
tested separately later. 

The comparator function compares the decoded bit 
stream to a delayed version of the simulated unencoded 
bit stream. It includes a variable- del ay buffer (used to 
align the two streams) and a set of counters that allows 
for collection of data for BER versus E^/No evaluation. 

The output interface circuit receives decoded bits and 
status data from the (7,1/2) decoder chip or from the Pro- 
cessor Assembly, records these in a test RAM, and channels 
the data to external devices through a Frame Synchroniza- 
tion Subsystem (FSS) driver or a FIFO. The FIFO will be 
used for interface to the new TPA. An optional differential 
decoder is also part of the output circuit. 

F. Other Boards 

To complete the BVD, several more boards will be 
designed and manufactured. The Controller Assembly will 
have two more boards: the Processor Assembly Interface 
board and the Node Synch board. The Processor Assembly 
will have seventeen boards mounted in a complex custom 
backplane. The backplane is being designed and manu- 
factured by Teradyne Connector Systems to JPL speci- 


fications, while the seventeen boards are being designed 
by JPL. Sixteen of these boards will be identical and will 
house custom VLSI chips, while the seventeenth will per- 
form traceback and interface functions. 

III. Laboratory and CTA-21 Test 
Results 

Tests were conducted in the laboratory and in CTA- 
21 to validate BVD operation and compatibility with the 
DSN. It is important to note that as DSN interfaces are 
upgraded, similar tests will need to be conducted with the 
new interfaces. However, the interfaces validated here are 
the minimal set needed for the planned 1991 demonstra- 
tion in DSS-14, namely symbol input from SSA/BBA and 
command/control through an RS232 to a terminal. 

During the CTA-21 test, the BVD was connected as 
part of a telemetry string (Fig. 3). The test was conducted 
at high SNR and the BVD was connected to the station’s 
SSA, replacing the current maximum-likelihood convolu- 
tional decoder (MCD). Successful decoding was demon- 
strated for several input sequence formats. The test used 
the Goddard code, i.e., reverse order for code polynomials, 
and will be repeated when the circuit is modified to han- 
dle both the DSN and Goddard codes. However, the key 
objective of verifying DSN compatibility was achieved. 

IV. Conclusions 

A partial BVD was constructed that has the capabil- 
ity to decode (7,1/2) convolutional codes. It was demon- 
strated in the laboratory and in the CTA-21 environment, 
validating DSN compatibility for the planned May 1991 
demonstration. 
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COMPUTER/MONITOR/CONTROL 



Fig. 1. Functional block diagram of Big Viterbi Decoder: (a) original, (b) modified to 
Include (7,1/2) decoder chip, (c) during test. 
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Fig. 2. Detailed block diagram of Big Viterbi Decoder. 
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Fig. 3. CTA-21 test setup. 
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This article considers a Reed-Solomon (RS) code to be a special case of a re- 
dundant residue polynomial (RRP) code, and presents a fast transform decoding 
algorithm to correct both errors and erasures. This decoding scheme is an im- 
provement of the decoding algorithm for the RRP code suggested by Shiozaki and 
Nishida [1 ], and can be realized readily on VLSI chips . 


I. Introduction 

Classes of redundant residue polynomial (RRP) codes 
were introduced first in [3,4]. These codes are constructed 
by use of the Chinese remainder theorem for polynomials 
over a finite field GF(q). The codeword symbols of the 
RRP codes are expressed as polynomials over this field. 
The RRP codes can correct t error symbols with the aid 
of 2 1 redundant symbols. 

Reed-Solomon (RS) codes constitute a subclass of 
RRP codes and are used in many sectors of today’s indus- 
try. Some examples are the (255,223) 16-error-correcting 
RS code (NASA code) used in deep-space communications, 
the (31,15) 8-error-correcting RS code (JTIDS code) used 


in military communications, and the Cross Interleaving RS 
code (CIRC code) used in the compact-disc industry. 

As Shiozaki [5] points out, by using the Chinese re- 
mainder theorem together with the Euclidean algorithm, 
an RRP code can be decoded without solving the error- 
locator polynomial and the error-evaluator polynomial. 
The decoder developed in [5] is a general frequency- domain 
implementation type depicted in the second block diagram 
in Fig. 9.2 of [2]. The advantage of the decoder in [5] over 
the decoder in [2] is that both the recursive extension and 
the inverse transform can be replaced by a single polyno- 
mial division. However, the method proposed by Shiozaki 
has the disadvantage that the reconstruction of the cor- 
rupted information polynomial F'(x ) from the received 
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symbols involves n polynomial multiplications in GF(q) i 
followed by the operation modulo M(x), where n is the 
codeword length and M{x) is a product of n polynomials. 
These operations severely lower the decoding speed. 

This article considers RS codes to be a special case 
of the RRP codes and proposes to decode RS codes by 
the use of both the Fermat number transform [6,7] and 
the Euclidean algorithm. The Fermat number transform 
(FNT) eliminates polynomial multiplications and reduces 
the number of multiplications needed to reconstruct F'(x ) 
to nlog 2 n. The fast transform decoding scheme proposed 
in this article is faster than the decoding algorithm in [5]. 


II. Some Preliminaries on Finite Fields and 
the Fast Fermat Number Transform 


Let GF(q ) be a finite field, let n be a number that 
divides q — 1, and let 7 be a primitive nth root of unity. 
Define (fli)^ 1 to be a sequence of n elements from GF(q ). 
A discrete Fourier transform of this sequence of length n 
is defined by 


n — 1 

Ak = ^ cii~f ki mod q for (k = 0, 1, . . . , n — 1) (2a) 

i—o 


The inverse discrete Fourier transform of A \ t is defined 

by 


rn-l 


a,- = n 


-l 




Lk=0 


mod q for (i = 0, 1, . . . , n — 1) 

(2b) 


Given that GF(q) is a finite field, let GF(q)[x] be the 
ring of polynomials over GF(q). 

Definition 1 . The two polynomials m^x) and m 2 (x) 
over GF(q) are said to be relatively prime if and only if 
the greatest common multiple of mi(x) and m 2 (x) is a 
constant in GF(q). 


A direct computation of the transform in Eq. (2a) or 
its inverse transform in Eq. (2b) requires n(n — 1) multi- 
plications. 

When q is a Fermat prime, the Fermat number trans- 
form (FNT) over GF(q) can be used. A Fermat prime F m 
is defined by 


Definition 2. The two polynomials m^x) and m 2 (x) 
over GF(q) are said to be congruent modulo m(x), i.e., 
mi(x) = m 2 (x) mod m(x) if and only if m(x) divides 
mi(x) - m 2 (x). 


F m = 2 2m + 1 for (m = 1,2, 3,4) 
F m = 2 2 " 1 + 1 for (m = 1,2, 3, 4) 


The Chinese remainder theorem is presented here for 
convenience; proof can be found in [11]. Let M(x) = 
IE-i"* i(x) be a product of pairwise relatively prime poly- 
nomials. Let Ai(x), A 2 (x), . . . , A r (x) be any r polynomials 
such that deg[A,(x)] < deg[mj(x)],i = l,2,...,r. Finally, 
let tj(x) satisfy 

= 1 mod ruj(x) for (i = 1,2 ,..., r) 

m i x 

There then exists one and only one polynomial /(x) 
of GF(q)[x] of degree satisfying deg[/(x)] < d eg[M(x)], 
which uniquely solves the system of congruences: 

f(x) = A{(x) mod mj(x) 

The polynomial /(x) is given by 

f( x ) = E ( x ) mod ( ! ) 

rni{x) 


It is shown in [6,7] that integer 3 is a primitive 
l = 2 2m th root of unity in GF(F m ). Next, let n divide 
2 2 . Finally, suppose 7 is a primitive nth root of unity in 
GF(F m ) where 

7 = 3 ,/n 

The purpose of an FNT of length n is to compute ef- 
ficiently the transform sequence (^k^o using Eq. (2a). 
On the other hand, the inverse Fermat number transform 
(IFNT) of length n reconstructs the sequence (a*)?^ 1 from 
the sequence (^4k)jb=o v ^ a (2b). Since the order of 7 is 
a power of 2 in GF(i ? m ), the length of the sequence to be 
transformed is a power of 2. As a consequence, the very 
efficient FNT can then be used to yield a fast transform [6] 
which is analogous to the fast Fourier transform (FFT). In 
this case, the number of multiplications involved in evalu- 
ating such a transform sequence of length n is nlog 2 n [8]. 
A new type of Fermat number multiplier is developed in 
[9]. More details about the FNT can be found in [6]. 
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F( 7) = F(x) mod mi(x) } . . . , 


III. Nonsystematic RS Codes 

First, a set of RRP codes is defined. As shown next, 
these codes are constructed using the Chinese remainder 
theorem for polynomials over a finite field GF(q). Let 
m 0 (x), mi(x), . . and m n _i(£) be n relatively prime poly- 
nomials, and 

n — 1 

M(x) = JJ mi(x) 

i —0 

Assume that the degree of each m,(x) is d } and that 
kd information symbols u = (u 0 , «i, . . . , «jfcd-i) are repre- 
sented by the information polynomial as 

F(x) = uo -f- Uix + • ■ ■ + Ukd-\x kd 1 

where UieGF(q) and k < n. Then an RRP code is the 
residue representation of F(x) } that is, 

v_ — (yl 0 (a:),Ai(a:),...,j 4 n _i(i)) 

where A,(x) = F(x) mod rrii(x) and deg[Ai(x)] < d. By 
the Chinese remainder theorem, F(x) can be recaptured 
from Aj(x). The vector corresponding to the polynomial 
Aj(x) is named the ith symbol. A code vector v_ can correct 
error symbols less than or equal to t symbols if n — k > 2 t 
[ 3 , 4 ]. ' 

The following shows that RS codes are a subclass of 
RRP codes. In order to facilitate the fast encoding and de- 
coding procedures, which make use of the fast FNT meth- 
ods as described in Section II, the codeword length n is 
required to be a power of 2. 

Let 77i 0 (x), mi(ar), . . . ,m n _i(x) be n relatively prime 
polynomials given by 

m x {x) = x — 7 1 for (i = 0, 1, . . . , n — 1) 

Also let the k information symbols 

(u 0 ,Ui,... ,u k _ 1 ),u i eGF{q) 
be denoted by the information polynomial 

F(x) = wo + wix -f h Uk-\x k ~ l 

Then the equations 

F( 1 ) = F(x) mod mo(x) 


and 

F( 7 "- 1 ) = F(x) mod m n _i(x) 
lead to a code vector v represented by 

V = (^0,^1, • ■ • , A n _ t ) = . . , F{ t”" 1 )) 

The code vector y_ is a nonsystematic RS codeword. 
It is not difficult to see that u_ = (Aq> Ai, . . . , A n _i) is 
just the FNT of the sequence (uo,ui, . . . ,U£_i, 0 , . .. , 0 ) 
and the k information symbols (wq, «i, . . . , i.e., 

F(x) can be recaptured by an IFNT on the code vector 
IL — (Ao, Ai, . . , , 

On the other hand, since an RS code is a special case 
of an RRP code, the information polynomial F(x) can be 
recaptured also from v_ = (Ao, A \ , . . . , A n _ 1) by the use of 
the Chinese remainder theorem. Let 2»(x) denote a poly- 
nomial that satisfies 

^^ X \ ti(x) = 1 mod m,(x) for (i = 0 , 1 , . . . , n — 1 ) ( 3 ) 
77l, - (x ) 

where 

n— 1 n— 1 

M(x)=J[Mi(x)= n>-V) 

i=0 i= 0 

Then the information polynomial F(x) can be recon- 
structed as 

F(x) = V - - --t t (x)A ; - (mod M(x)) 

IV. Decoding RS Codes 

As Shiozaki et al. [ 1 , 5 ] point out, by using the Chi- 
nese remainder theorem together with the Euclidean algo- 
rithm, the RRP codes, which include the RS codes, can be 
decoded without solving the error-locator polynomial and 
the error-evaluator polynomial. However, that method has 
the disadvantage that the reconstruction of the corrupted 
information polynomial F'(x) from the received symbols 
involves n polynomial multiplications in GF(q ) followed 
by the operation modulo M(x). These operations can sig- 
nificantly lower the decoding speed. A modified decoding 
scheme, which makes use of the fast transform technique 
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to bypass the tedious polynomial multiplications and mod- 
ulo M(x) operation, is given in the Appendix. 

A. Decoding for Correcting Errors 

The overall decoding of nonsystematic RS codes for 
correcting errors using the Euclidean algorithm is summa- 
rized in the following (see the Appendix for details): 

1. Compute the IFNT of the received code word x/ = 

from Eq. (A-l) in the Appendix to 

obtain 

F'(x) = u' 0 + u[x + ■ ■ • + t4_iz n_1 

in Eq. (A-3). Next, calculate the degree of F'(x ). If 
degfF^a?)] < Jk, where k is the number of information 
symbols, then the information polynomial F(x) = F f (x)\ 
otherwise, go to step 2. 

2. To determine the error-locator polynomial D(x) in 
Eq. (A-5) and F'(x)D(x) ) apply the Euclidean algorithm 
to M(x) defined in Eq. (3) and F f (x). The initial val- 
ues of the Euclidean algorithm are Pi(z) = 0,p o (^) = 
1, r_i(x) = M(x) y and r 0 (x) = F'(x). The iterative 
procedure of the Euclidean algorithm terminates when 
deg[r*(x)] < n - [(d - 1)/2J where [x\ denotes the great- 
est integer less than or equal to x. 

3. Compute F(x) from Eq. (A- 14). 

A flowchart of a decoding algorithm to correct errors 
only is depicted in Fig, 1. An example of this decoding 
scheme is given in Example 1. 

B. Decoding to Correct Errors and Erasures 

Shiozaki [5] suggests a decoding scheme to correct 
both errors and erasures. This algorithm ignores the era- 
sure locations and uses the Chinese remainder theorem and 
the Euclidean algorithm to decode the shortened RS code- 
word. However, the shortened codeword loses the FFT 
structure; thus, a fast transform decoding scheme cannot 
be used. In this section, an improved decoding scheme 
is suggested which uses the fast-transform techniques dis- 
cussed in the previous sections to decode RS codewords 
with both errors and erasures. 

Suppose an RS codeword is transmitted through a 
noisy channel. Let there be s erasure symbols and t error 
symbols in the codeword such that 2t + s < n — k. Next, 
assume that the symbols at positions k\ y k 2} . . . , k 9 are era- 


sure symbols and that the symbols at positions £\, l 2) • * • i tt 
are error symbols. Finally, define 

5 

Di{x) = JJ(x - j kt ) (known) (4) 

1=1 

and 

t 

D 2 (x) = ” 7*0 (unknown) 

i= 1 

and 

D(x) = D 1 (x)D 2 {x) 

By an extension of the derivation of the key Eq. (A-9) 
given in the Appendix, the following key equation for both 
errors and erasures can be obtained: 

~M(x)B(x) + F\x)D l {x)D 2 {x) = F(x)Di(x)D 2 (x) 

( 5 ) 

where B(x) is as defined in Eq. (A-5) in the Appendix, 
deg [D 2 (x)] < [_(d-l-s)/2j, and deg[F(x)Di(x)D 2 (x)] < 
n — |_(d — 1 — s)/2j — 1. 

The Euclidean algorithm is an iterative procedure 
which can be used to find in Eq. (5) the greatest common 
divisor (GCD) of M(x) and F'(x)D\(x) [10]. An impor- 
tant intermediate relationship among the polynomials of 
the Euclidean algorithm is given in the equation 

F , (x)Di(x)p i (x) + M (x)si(x) = r,(x) (6a) 

and 

deg[p;(x)] + deg[rj(x)] < deg[M(x)] for - 1 < i < m 

(6b) 


where i is the iterative index and r m (:c) is the GCD of 
F'(x)Di(x) and M(x). The algorithm involves four se- 
quences of polynomials: s,(x), Pi(z), r *( x )> an d <&(*)■ The 
initial conditions are set in accordance with the following 
rules: 

1. For deg[F / (x)Di(x)] < deg[Af(x)], set s_ i(x) — 1, 
s 0 (x) = 0, P-i(x) = o, po{x) = 1, r_i(x) = M{x), and 
r 0 (x) = F’(x)D 1 {x). 
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2. For deg[F'(x)Di(x)] > deg[M(x)], set s_i(x) = 0, 
s 0 {x) = 1, pi(x) = 1, po(x) = 0, r-i(x) = F'(x)Di(x), 
and r 0 (x) = M(x). 

Since 2t 4 5 < n — k } 


mation polynomial F'(x) in a manner analogous to syn- 
drome computation in the conventional decoding schemes. 
However, this new algorithm does not require the extra 
steps needed to solve the error-locator and error-evaluator 
polynomials. 


deg[D 2 (x)] + deg[F(x)D!(x)D 2 (x)] = 2t + s + k-l<n 

= deg[M(x)] 


(7) 


Therefore, let 2t 4- s < n — k, u — L(c? — 1 — s)/2J , and 
v — n — |_(d — 1 — s)/2\ — 1 . By the proof of the theorem in 
the Appendix and Eqs. (5), (6a), (6b), and (7), there ex- 
ists a unique index j in the Euclidean algorithm such that 
D 2 (x) = A (x)pj(x) and F(x)Di(x)D 2 (x) = A(x)rj(x), 
where A(x) is some polynomial, deg[pj(x)] < [(d ~ 1 — s)/2J, 
and deg[r ; *(x)] < n — [(d — 1 — s)/ 2J. Thus, F(x) can be 
reconstructed as follows: 


F(x) 


rjjx) 

Pi(x)D x (x) 


( 8 ) 


The overall decoding of nonsystematic RS codes for 
correcting errors and erasures using the Euclidean algo- 
rithm is summarized in the following steps: 


L Use step 1. from the description of decoding for 
correcting errors. 


C. Examples of the Two Decoding Methods 

Example 1. Consider the Fermat prime F 3 = 17, and 
let k = 4. This is an (8,4) RS code over GF(l7) y which is 
capable of correcting two errors or less. It is shown in [6] 
that 7 = 2 is a primitive 8th root of unity. Also, for this 
case. 


i = 7 

M(x) ~ jQ(x — 7*) = x s — 1 

Let the four information symbols be u = (2, 3, 1,4). 
Then F(x) = 2 4- 3x 4- x 2 4 4x 3 . An FNT on the sequence 
(2, 3, 1, 4, 0,0,0, 0) yields the codeword £ = (10, 10, 14, 13, 

13.2.5.0) . Next, let the third and seventh symbols be er- 
roneous. Thus, e = (0,0, 5, 0, 0,0, 15, 0) and t/ = (10, 10, 2, 

13.13.2.3.0) , where i/, e, and i/ are as defined in the 
Appendix. After taking the IFNT on i/ y one obtains 
F'(x) = 13 4- 8x 4- 7x 2 + 16x 3 + llx 4 4 5x 5 4 6x 6 4 12x 7 . 
The Euclidean algorithm stops after the second iteration 
to yield r 2 (x) = 10x 5 4 llx 4 4 9x 3 4 16x 2 4 16x 4 5 and 
p 2 (x) ~ llx 2 4 11. Then F(x) is recaptured as 


2. Compute the erasure- locator polynomial D\(x) 
from Eq. (4). Next, compare deg[F / (x)Di(x)] with 
deg[M(x)]. If deg[F'(x)£)i(x)] < deg[M(x)], set P-i(x) = 
0 , Po(x) = 1 , r_x(x) = M(x) y and r 0 (x) = F , (x)D 1 (x)] 
otherwise, set p_i(x) = 1 , po(x) = 0, r_i(x) = 
F'{x)D\{x), and ro(x) = M(x). 

3. To determine the error-locator polynomial D 2 (x) 
and F'(x)D(x), apply the Euclidean algorithm to M(x) 
and F'(x)D 1 (x). The initial values of the Euclidean algo- 
rithm are defined in step 2; the iterative procedure of the 
algorithm terminates when deg|>t(x)] < n— [(d— 1 — s)/2j. 

4. Compute F(x) from Eq. (8). 

A flowchart of the decoding scheme for correcting both 
errors and erasures is given in Fig. 2. A depiction of this 
decoding scheme is presented in Example 2. 

This simpler, faster transform-decoding scheme using 
the FNT for RS codes is particularly suitable for pipeline 
VLSI implementation. The transform-decoding scheme 
utilizes an efficient FNT to compute the corrupted infor- 


F(x) = = 2 + 3a: + x 2 + 4x 3 

P 2W 

That is, u = (2, 3, 1,4). 

Example 2. Consider the same codeword v_ — 
(10, 10, 14, 13, 13, 2, 5, 0) given in Example 1. Let the first 
symbol be an error symbol, and the third and seventh sym- 
bols be erasure symbols. Thus, e = (1,0,5,0,0,0,15,0), 
and j/ = (11, 10, 2, 13, 13, 2, 3, 0). After the IFNT is taken 
of z/ , one obtains F'(x) = Il46x45x 2 4l4x 3 49x 4 43x 5 4 
4x 6 4 10x 7 . Since the erasure symbols are at the third and 
seventh positions, Di(x) = (x — 2 2 )(x — 2 6 ) = x 2 4l* Thus, 
F'(x)Dx(x) = 10x 9 44x 8 4l3x 7 4l3x 6 4l4x 4 43x 3 4l6x 2 4 
6x 4 11. The Euclidean algorithm stops after the second 
iteration to yield r 2 (x) = x 6 4 12x 5 4 10x 4 4 16x 3 4 4x 4 8 
and p 2 (x) — 13x 4 4. Then F(x) is recaptured as 


F(x) 


P2(x)Di(x) 


— 2 + 3x + x 2 + Ax 3 


That is, u = (2, 3, 1, 4). 
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V. Conclusions 

In this article, a fast transform decoding scheme is in- 
troduced which is particularly suitable for VLSI implemen- 
tation. This scheme first utilizes the highly efficient Fermat 
number transform to calculate the corrupted information 


polynomial F'(x). It then uses the Euclidean algorithm to 
evaluate the information polynomial F(x) directly, with- 
out going through the intermediate steps of solving the 
error-locator and error-evaluator polynomials. Thus, this 
fast-transform decoding scheme is faster and simpler than 
the decoding scheme in [1]. 


Acknowledgments 

The authors wish to thank Dr. Les Deutsch, Dr, Robert J. McEliece, 
Dr. Ed Satorius, and Dr. Laif Swanson at the Jet Propulsion Laboratory for their 
mathematical suggestions pertaining to the research which led to this article. 


References 

[1] A. Shiozaki and F. Nishida, “A New Decoding Method of Redundant Residue 
Polynomial Codes,” Bull Univ. of Osaka Prefecture , series A, vol. 24, no. 1, 
pp. 101-112, 1975. 

[2] R. Blahut, Theory and Practice of Error Control Codes, Reading, Mas- 
sachusetts: Addison- Wesley, 1984. 

[3] I. S. Reed and G. Solomon, “Polynomial Codes Over Certain Finite Fields,” 
SIAM, vol. 8, no. 2, pp. 300-304, 1960. 

[4] J. J. Stone, “Multiple-burst Error Correction with the Chinese Remainder The- 
orem,” SIAM , vol. 11, no. 1, pp. 74-81, 1963. 

[5] A. Shiozaki, “Decoding of Redundant Residue Polynomial Codes Using Euclid’s 
Algorithm,” IEEE Trans . Information Theory , vol. 34, no. 5, pp. 1351-1354, 
September 1988. 

[6] R. C. Agarwal and C.S. Burrows, “Fast Convolution Using Fermat Number 
Transforms with Applications to Digital Filtering,” IEEE Trans. Acoustics, 
Speech , and Signal Processing , vol. ASSP-22, no. 2, pp. 87-97, April 1974. 

[7] I. S. Reed, T. K. Truong, and L. R. Welch, “The Fast Decoding of Reed- 
Solomon Codes Using Fermat Transforms,” IEEE Trans . Information Theory , 
vol. IT-24, no. 4, pp. 497-499, July 1978. 

[8] A. Oppenheim and R. Schafer, Digital Signal Processing , New York: Prentice- 
Hall, 1975. 

[9] H. C. Shyu, T. K. Truong, I. S. Reed, I. S. Hsu, and J. J. Chang, “A New 
VLSI Complex Integer Multiplier Which Uses a Quadratic-Polynomial Residue 
System with Fermat Number,” IEEE Trans . Acoustics, Speech , and Signal Pro- 
cessing, vol. ASSP-35, no. 7, pp. 1076-1079, July 1987. 


135 


[10] R. McEliece, The Theory of Information and Coding , vol. 3 of The Encyclopedia 
of Mathematics and Its Applications , Reading, Massachusetts: Addison- Wesley, 
1977. 

[11] E. R, Berlekamp, Algebraic Coding Theory, New York: McGraw-Hill, 1968. 



ORIGINAL PAGE IS 
OF POOR QUALITY 



. 2^ = Qf (x)/y _ 1 (X) + r i (x) 
deglry <x)] < deglry. ^x)] 



Fig. 2. Flowchart of decoding procedure to correct errors and 
erasures. 





















Appendix 

Decoding RS Codes Using the Euclidean Algorithm 


Suppose the codeword v_ = (Ao, Ai, , . . t A n -x) is 
transmitted through a noisy channel. Assume that the 
symbols at positions an ^ it are in error. The 

received codeword 1 / is thus represented by 

V 1 = £+e = (A' 0 ,A' u ...,A' n _ l (A-l) 


and 


E{x) = 


t 


E 


M(x) 


U,( x ) e ti 


mod M(x) 


where e is the error vector defined by 


Let 


e = (0, . . . , e e , , 0, . . . , e tt , . . . , 0)V (A-2) 

Let (u{ |J u / ll ... ) uJ l _ 1 ) and (i» 0 ,t»i,...,w n _i) be the 
inverse transforms of v f and e respectively. Also let where 
F*{x) = u'q + u^x + * * * + wJj.jx" - 1 be defined as the cor- 
rupted information polynomial, and E(x) == wq + w\x + 

• • ■ -f iu n _ii n_1 be defined as the error polynomial. 


B{x) = 


y- D{x) 

u 


mod D(x) 


dw = 

•=i 


(A-4) 


(A-5) 


It is not difficult to see that the residue representations 
of F*(x) and E(x ), modulo m t (x) are Eqs. (A-l) and (A-2) 
respectively. That is, 1 / and e can be written, respectively, 
as 


is called the error- locator polynomial. The key equation of 
the decoding algorithm is derived from these relationships. 
First, let 


£' = (F , (l),F'(7) > ...,W 1 )) 

and 

e=(£(l),E( 7 ) > ...,W 1 )) 

From Section III, an RS codeword v_ is generated by 
an information polynomial F(x) via the following: 


A{x) = 


M{x) 

D{x) 


(A-6) 


t 


*'(*) = E 

t=i 


m ti (x) 


Ui(x)e t t 


K — (F(l), F(7 ), . . . , F(7 n-1 )) 


Then, by Eq. (A-4), one has 


Since j/ = v + e, one obtains F'{ 7*) = F( 7 1 ) -f E{ 7*) 
for 0 < i < n — 1. Thus, there are at least n values of x for 
which F'(x) and F(x) + E(x) are equal. It is obvious that 
deg[F(x)] < fc,deg[F'(x)] < n, and deg[F(x)] < n. Hence, 
by the fundamental theorem of algebra, 

F f (x) — F(x) + E(x) (A-3) 


E(x) = 


^±m tlXx)eii 


L mt ‘ 


mod M(x) 


[^(x).#'^)] mod A(x)D(x) (A-7) 


Since RS codes are a special case of RRP codes, it is 
shown in [5] that F'(x) and F(x) can also be reconstructed 
using the Chinese remainder theorem as follows: 


F\x) = 


E 


M(x) 

77lp(x) 


U(x)A' i 


mod M(x) 


Equation (A-7) can now be re-expressed as: 


E(x) — A(x) [A (x)D(x) + F'(x)] 

= A(x) [B\x) mod D{x)] (A-8) 
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where A(x) is some polynomial over the finite field. Using 
Eq. (A-5), a substitution of A(x) and B'(x) in Eq. (A-6) 
into Eq. (A-8) yields 

EM = (A-9) 

The proof of Eq. (A- 9) is similar to the proof given 
in [5]. A similar result of Eq. (A-9) is given by Blahut in 
theorem 9.1.1 of [2] using a spectral technique. The de- 
coder in Fig. 9.2 of [2] applies the Euclidean algorithm to 
M(x) and the 2 1 high-order coefficients of E(x) to deter- 
mine the error-locator polynomial D(x). Then a recursive 
extension is used to compute the rest of the coefficients of 
E(x) from the known D(x). Finally, the inverse transform 
over GF( 2 n ) of Ej is taken to recover the error pattern. 

The next paragraph describes how the decoder de- 
veloped in this article applies the Euclidean algorithm to 
the polynomials M{x) and F'(x) rather than to the usual 
M(x) and the syndrome polynomial 5(x), i.e., the 2 1 high- 
order coefficients of E(x). In other words, to determine 
polynomials D(x) and F(x)D(x)> this new decoder applies 
the Euclidean algorithm to M(x) and F\x). Thus, F(x) 
can be reconstructed from F(x) — F(x)D(x)/D(x). The 
advantage of this new decoder over the decoder developed 
in Fig. 9.3 of [2] is that both the recursive extension and 
the inverse transform can be replaced by a single polyno- 
mial division. 

By combining Eqs. (A-3) and (A-9), the key equation 
is obtained as follows: 

-M{x)B{x) + F , {x)D(x) = F(x)D(x) (A-10) 
where B(x) is defined as in Eq. (A-5). 

The Euclidean algorithm is an iterative procedure to 
find the greatest common divisor (GCD) of M(x) and 
F'(x) [10]. An important intermediate relationship among 
the polynomials of the Euclidean algorithm is given in the 
following: 

-M(x)si(x) + F'(x)pi(x) = r,(x) (A-ll) 

and 

deg [Pi (a)] + deg [r,(x)] < deg [M (x)] 

and 

for — 1 < i < m (A-12) 


where i is the iterative index, and r m (x) is the GCD of 
F'(x) and M(x). The algorithm involves four sequences 
of polynomials: s;(x), p;(x), rj(x), and g,(x). The initial 
conditions are: s_i(x) = 1, s 0 (x) — 0, P-i(x) — 0, Po(x) = 
1, r_i(x) = M(x), and r 0 (x) = F 7 (x); q-i(x) and qo(x) 
are not defined. 

The following lemma and theorem [10] show that the 
Euclidean algorithm can be applied to the key Eq. (A-10) 
to solve for the information polynomial F(x). 

Lemma. Given two non-negative integers p and v 
with v > deg[r m (x)] satisfying p + v = deg[M(x)] - 1, 
there exists a unique index j % 0 < j < m, such that 

deg[Pj(x)] < p 

and 

deg [rj (x)] < v 

For the proof, see [10]. 

Using the above lemma, the following theorem can be 
proved [10]: 

Theorem. Suppose p(x), s(x), and r(x) are nonzero 
polynomials satisfying 

— M(x)s(x) -F F'(x)p(x) = r(x) 

and 

deg \p(x)} + deg [r(x)] < deg [M(x)) 

There then exists a unique index j, 0 < j < m, and a 
polynomial A(x) such that 

p(x) = X(x)pj(x) 

and 

r(x) = A(x)rj(x) 

Now let n — k = 2 T, where T is the maximum number 
of errors in an RS code which can be corrected. If the 
number of errors t in a received RS codeword is less than 
or equal to T, then deg [D(x)] < T and deg [F(x)D(x)] < 
HT-Un-T-1. Thus, 

deg [D(x)] + deg [F(x)D(x)] < deg [M(x)] = n (A-13) 
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Thus, let n — k > 2t, u = T, and v — deg [M(x)] — 1 — 
fi = n-T—1. By the proof of the above theorem and Eqs, 
(A- 10), (A- 11), (A- 12), and (A- 13), there exists a unique 
index j in the Euclidean algorithm such that D(x) = 
A(x)pj(x) and F(x)D(x) = A(x)rj(x), where deg[pj(x)] < 


T and deg [r^(x)] < n — T - 1. Thus, F(x) can be recon- 
structed as: 


F(x) 


Pj ( x ) 


(A-14) 
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This article presents some results in terms of the performance improvement of 
a multi-feed array configuration over the usual single feed system when an adaptive 
least-squares algorithm is applied for the signal reconstruction. The article presents 
two novel versions of the least-squares algorithm , one of which is based on the max- 
imization of the signal-to-noise ratio while the other is based on the deconvolution 
of the received signal field. These algorithms have been developed for the purpose 
of minimizing degradations arising from various sources, which can severely limit 
the performance (gain) of a single- feed system. 


I. Introduction 

The development of multi-element array process- 
ing techniques has many potential applications for the 
Deep Space Network (DSN). These include signal recon- 
struction for both X-band and Ka-band communications 
[1-4] and electronic pointing to augment existing mechan- 
ical pointing techniques. For all of these applications, 
multi-element array processing can generally provide sig- 
nificant performance improvements over single-feed an- 
tenna configurations, which are predominantly used in the 
DSN. 

This article presents some results in terms of the per- 
formance improvement provided by a linear multi-feed ar- 
ray incorporating the proposed adaptive least-squares al- 
gorithm over a single-feed array system, as applied to the 
signal reconstruction problem. While the assumed linear 
array geometry is idealized, the results of this analysis pro- 


vide an indication of performance improvements that can 
be achieved with adaptive, multi-element array processing. 
This article describes two versions of the least-squares al- 
gorithm, one of which is based on the maximization of the 
signal-to-noise ratio while the other is based on deconvo- 
lution of the received signal field. Here, instead of trying 
to model the signal degradations in terms of deterministic 
equations in evaluating the performance of the algorithm, 
it is assumed that these degradations are “unknown” to 
the algorithm and vary with time. The algorithm tries to 
implicitly estimate these degradations in an adaptive man- 
ner from the samples of the noisy received signal. On the 
basis of these measurements, it computes a set of weights 
for combining signals at the outputs of various feeds in or- 
der to maximize the signal-to-noise ratio of the combined 
signal. 

For the purposes of illustrating the basic concepts in- 
volved with adaptive array processing, this article presents 
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the results for a 16-element linear-array feed system. The 
performance of the least-squares algorithm is to a first or- 
der determined by the signal-to-noise ratio of the received 
signal, the number of feeds in the configuration, and the 
time constants at which the received signal field is varying 
with time. 

In practice, it may be possible to simultaneously cor- 
rect for multiple degradations arising from different sources 
and having different time constants. These degradations 
may be induced by wind, gravitational loading, or antenna 
pointing errors. Simultaneous correction for such degrada- 
tions could be achieved by adjusting the time constants of 
the algorithm to track the fastest mode, in which case the 
slower modes would be estimated sub-optimally. Alterna- 
tively, one could track the most significant mode, thereby 
essentially ignoring the faster but less significant modes. 
More sophisticated techniques could also be used for sep- 
aration of the modes and tracking of them separately. 

II. Array Configuration 

The specific array configuration of interest in this ar- 
ticle corresponds to a linear-feed array distributed across 
the focal plane of an antenna. The array outputs are then 
fed to a parallel receiver bank as indicated in Fig. L As 
shown in the schematic diagram of Fig. 1, the signal out- 
puts from the feed elements are amplified by r f amplifiers. 
Assuming that all of the amplifiers have equal gain and 
noise temperature, the output of the ith amplifier can be 
written as 

ri(J) = Ai(t) cos (w c t + + rii(t ) (1) 

where A,*(t) and @i(t) are the signal amplitudes and phases, 
lo c denotes the signal carrier frequency, and n,-(t) is a zero- 
mean white Gaussian noise of one-sided spectral density 
No. The noise is also assumed to be spatially uncorrelated, 
i.e., E [ui(0 n j(0] = 0 f° r z i 1 J an d *> j = N. Under 

ideal conditions and assuming a plane- wave normally inci- 
dent on the antenna aperture, the amplitude of the center 
feed would be equal to y/2P ( P denotes the normalized 
power received by the antenna), while the remaining feeds 
will have nearly zero amplitude. However, array degrada- 
tions can disperse the signal amplitude (and phase) spa- 
tially over N feeds. These degradations can arise due to 
various sources, such as gravity, thermal fields, wind, and 
atmospheric turbulence. 

In the presence of such degradations, the adaptive sig- 
nal processing algorithm then combines the N feed outputs 
in a coherent manner so as to optimise some performance 


index, such as the signal-to-noise power ratio of the com- 
bined signal. This processing can occur either at r f or 
can be equivalently done at the baseband. Alternatively, 
in some possible imaging applications, it may be desired 
to reconstruct the complete focal plane field, i.e., obtain 
N output signals that are close to the outputs of the focal 
plane fields in the ideal antenna case. 1 Note that in the 
more general case, more than one feed may have signifi- 
cant amplitude if the source is not a point source and the 
antenna is capable of resolving such a composite source. 
In the limiting case, one may simply use the center out- 
put of the reconstructed field and ignore the others, thus 
achieving an alternative combination of the input signals. 

For the purposes of signal processing, the N received 
r f signals r*(tf); i = 1,2 are down-converted and 
quadrature sampled to obtain the sampled version of the 
complex baseband envelope of the r f signal r*(£) with 

ri(t) = Re{gi(t)e JWct } 

9i (t) = Ai(ty 9 ‘W + „<(*) (2) 

9i(t) = {ri(t) + 

In Eq. (2) above, r*(t) denotes the Hilbert transform 
of ri(t) } and Vi(t) is the complex envelope of the bandpass 
noise n,-(<). 

III. Signal Combining Via Adaptive 
Least-Squares Algorithm I 

As shown in Fig. 1, the adaptive algorithm deter- 
mines the time-varying complex-valued weighting coeffi- 
cients wi(k) ) on the basis of signal samples 

i — 1,2, N; and j = 1, 2, ..., k according to some appro- 
priate optimization criterion. The algorithm is adaptive 
in the sense that if the signal amplitudes and phases 
and remain relatively constant with time, then with 
increasing value of &, the algorithm achieves increasingly 
accurate estimates of these parameters, and the weighting 
coefficients converge asymptotically to their theoretically 
optimum values with an exponential convergence rate. On 
the other hand, if these parameters are time-varying, then 


1 V. Vilnrotter, a Ka-Band Array Signal Processing Progress 
Report,” JPL Interoffice Memorandum No. 331-88.5-047 (in- 
ternal document), Jet Propulsion Laboratory, Pasadena, Cal- 
ifornia, November 1988. 
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the algorithm tracks these variations and the weighting co- 
efficients are truly time-varying (there is no tendency for 
Wi(k ) to converge to some constant value). 

Denoting by w(k) and g(k) the weighting coefficient 
vector [uq (&)ii ?2 (&)••• w;v(A:)] / and the measurement vec- 
tor [gi(k)g 2 (k) . . . g^(lc)] / respectively, then the familiar 
least-square optimization criterion is to select w(k) so as 
to minimize the following index 


One may note that the entity to be inverted in Eq. (7) is 
only a scalar. Decomposing the sum in Eq. (5) as 

k - 1 

+i( k ) 

i=i 

and substituting Eq. (7) for _P(£), we obtain the following 
expression for w ELS (k). 


J k = Y, li-^WffCOI 2 

j = 1 


( 3 ) 




with respect to the weight vector w(k) for k = 1,2, .... In 
the above, the superscript H denotes conjugate transpose 
while i represents just the transpose of a matrix. The op- 
timal solution, termed least-squares estimate of w(k) } is 
given by (assuming k > N) 


™Ls( k ) = \ E£(i)£^(i) \ 


( 4 ) 


J= l 


j = i 


If the distortion process is time-varying, then it is more 
appropriate to replace the index J * by the one obtained 
by multiplying the summand in Eq. (3) by A*” 7 for some 
0 < A < 1, minimization of which yields the following ex- 
ponentially data- weighted least-squares estimate for w(k). 


™ELs( k )— 52 (®) 

Note that in the adaptive algorithm’s present non-recursive 
form, Eq. (4), it is required to invert an (N x N) matrix for 
every time instance k in the computation of w(k), which 
is somewhat computationally intensive. This problem can 
be overcome by replacing the estimate in Eq. (5) with its 
recursive form, which is obtained as follows. 

Denoting by P.(k) the matrix inverse in Eq. (5), then 
the matrix PT l {k) has the following update. 


- [A +£*(*)£(* -%(*)] 
x P(k - 1 )g_(k)g H (k)P(k - 1) 

x j +£(Jb) £ (fc) ( 8 ) 

By noting that the first term in Eq. (8) and the product 
of the last two factors in the second term both are equal 
to w.ELs( k ~ 1)> Eq. (8) may be rewritten as 

m.ELs( k ) ~ mELs( k ~ 1 ) + £i k )i_( k ) 

- [A + g H (k)P(k - l)£(fc)] 
x P(k - l)g(k)g H (k)w ELS (k - 1) (9) 

Post multiplying both sides of Eq. (7) by g(k) and with a 
simple algebraic manipulation, it follows that, 

[\+i H (k)p(k-i) l (k)y 1 p(k-i) l (k)=p(k) l (k) (io) 


With the substitution of Eq. (10) in the last term of 
Eq. (9), one obtains the recursive version of the algorithm 
given below. 

™ELs{ k ) = l^ELs( k ~ 1 ) + Ei k )9{ k ) 


P- 1 (k) = \p- 1 (k-l) + g(k)g H (ky, k = 1,2,... (6) 

Application of the matrix inversion lemma [5] to Eq. (6) 
yields the following desired recursion for P(fc). 

P(k) = A -*{£(* - 1) - [A +g H {k)P(k - l)£(fc)] -1 

x P{k - l)g(k)g H (k)P(k - 1)} (7) 


x [l-£ ff (fc)t£ EZ , s (fc- 1)] ( 11 ) 

P(k) = A - 1 j£(fc - 1) - [A + g H (k)P(k - %(*)] 

x£(fc-l)£(fc)£ H W£(fc-l)|; 
jfc = 0,l,2,... 
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If the complex field g(k ) is a wide-sense stationary pro- 
cess, a considerable simplification in computations may be 
achieved by replacing E.(k) with an appropriate constant 
matrix in the first recursion of Eq. (11) and dropping the 
second recursion. 

IV. Maximization of Signal-to-Noise Ratio 
via Modified Least-Squares Algorithm I 

In some applications, it may be more appropriate to 
maximize the signal-to-noise power ratio at the combiner 
output. The noise variance at the output of the combiner is 
equal to \™i\ 2 E[Wi | 2 ] = <x 2 ||l£|| 2 with <r 2 denoting the 
variance of the sampled version of the complex baseband 
process in Eq. (2). Thus, as shown in the Appendix, 
an effective maximization of the output signal-to-noise ra- 
tio can be achieved by minimization of the index given in 
Eq. (3) subject to the equality 

INI 2 = K (12) 

for some constant K , or by simply minimizing the following 
index, 

Y i 1 - hl h ( k )gU)\ 2 + twomi 2 - k) (i3) 

i=i 

with respect to w and ?(&), where j(k) is the Lagrangian 
multiplier. Differentiation of Eq. (13) with respect to w 
yields the following constrained least-squares estimate for 
w in terms of 7 (k) as 

£«*(*) = |e£0)£%) + tW/| pM) (14) 

Substituting Eq. (14) into Eq. (12) yields an equation for 
the unknown 7 (Jfc), which can be solved for 7 (k). Substi- 
tuting 7 (k) back in Eq. (14) provides the constrained op- 
timum solution for the weighting coefficient vector. Note, 
however, that there is no close-form solution for 7(6) and, 
thus, some numerical optimization techniques may need 
to be applied to obtain t 7 CL5 . A simplified solution is 
obtained by selecting some appropriate value for j(k) in 
Eq. (14) and then normalizing the estimate to have its 
norm equal to one. In an exponentially data-weighted ver- 
sion of Eq. (14), both the summands are multiplied by A fc ~ J 
where A is the exponential data-weighting coefficient. With 
these modifications, Eq. (14) has the following equivalent 
form. 


P~\k) = \P~\k - 1) + 7 0/ + £(%"(*) 

}P(k) --- A £(fc - 1) + g(k) 

iv(k) = P{k)xp(k) (15) 

™NLS = ®W/ll^( fc )ll 

7 (*) = 7o(l - A‘)/(l - A) 

Note that Eq. (15) requires a matrix inversion for each 
value of k for which S(fc) is desired. An approximate re- 
cursive form for Eq. (15), which does not require matrix 
inversion, may also be derived by applying the matrix in- 
version lemma. Note that in Eq. (15), the higher value 
of 7o results in the higher relative weight assigned to the 
noise variance at the combiner output. The initial values 
for P _1 and ip at k = 0 may simply be selected equal to 
zero. 

V. Least Squares Algorithm II 

In an alternative solution, it may be assumed that 
the received focal plane signal is the result of the spatial 
convolution of the ideal signal (in the absence of any distor- 
tion) with an unknown filter response representing various 
distortions from all sources, including antenna surface de- 
formations due to gravity, wind, antenna pointing errors, 
turbulence, etc., i.e., 

£(*)=2£2C(*)+i'(*) (16) 

where B_ is a Toeplitz matrix, X is the focal plane sig- 
nal vector that under appropriate sampling is equal to 
[0 . . . 010 ... 0] in the ideal case of plane- wave with no dis- 
tortion, and v_ is the additive noise vector. The matrix B_ 
includes any distortion effects, pointing errors, etc. For the 
linear array case under consideration, B_ can be approxi- 
mated by a circular matrix for large N or becomes identical 
to a circular matrix provided the vector X(fc) and the sig- 
nal vector g(k ) are zero-padded. Thus, it is assumed that 
B is circular. In the absence of noise, it is observed that 
X — B_~ l g where B _~ 1 is also a circular matrix. How- 
ever, B_ is unknown, and it needs to be estimated from the 
noisy observations. Or, more directly, B_~ l is estimated as 
follows. Rewriting the model Eq. (16) as 

X(k) = L m g(k) + v(k); k= 1,2 ,... (17) 

where F is an unknown circular matrix to be estimated 
from the given measurement g(k)\ k — 1, 2 . . . , n. Letting 
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f T = [fi , / 2 . . . /jy] denote the first row of the matrix F_ } 
and g ^(k) denote the vector obtained by cyclically shifting 
g(k) right t times, the following equivalent signal model is 
obtained. 


'o' 


£ T (*) ' 


vr 


t>i(*o 

0 









£?(*) 




vt(k) 

1 





+ 


0 







0 




Jn_ 


VN(k) _ 


(18) 

A least-squares estimate for / can be obtained from the 
signal model Eq. (18) and in its non-recursive form is given 
hy 

/m = {£ (e '«>) j (w) 

A recursive form for the estimate f(k) may also be derived 
following the steps used in obtaining Eq. (11)- Here re- 
cursion is over both the signal sample vector g_(k) and its 
circular shifts. With appropriate initial values P(0,0) and 
estimate /(0,0), one has the following recursion in terms 
of the indices k and j, with k denoting time and j denoting 
the cyclic shift of the received signal vector, 

-[A + £ f_i(fc)P(fc,j- 1 )£._i(fc)]~ l 

x£(*,.? - !)£,_! (%J_ i (*)£(*• J “ !)}; 

3 = 1,2,... 

P_(k + 1,0) = £(&, N)] k= 1,2,... 
f(k,j) = f(k,j - 1 ) +£(fc,j)£ i _ 1 (*) 

x [$ -gf _ ,(*)/(*, i - !)]; 

J = 1,2,... 

/(* + 1,0) = /(*,*); k = 1 , 2 ,... 

( 20 ) 


where 

tj = i, j = im\ 

= 0, j* [N/2\ 

where [x\ denotes the least integer greater than or equal 
to x for any real x. The circular spatial convolution of 
/* with the received signal g(k) yields the reconstructed 
signal vector h(k). In the case of perfect reconstruction 
(deconvolution), the central element of the vector h is the 
combined signal, while the remaining elements would be 
zero. 

VI. Simulations 

The performance of the least-squares algorithms of 
the previous section is presented here in terms of simu- 
lations. In the following simulations, a linear feed array 
is considered. For the purposes of these simulations, the 
received signal focal field is generated by spatially Fourier 
transforming a simulated linear aperture plane array. The 
signals in the simulated aperture are assumed to be of 
equal amplitude but with completely independent phase 
processes. Also, for the purposes of simulations, each of 
these phase processes is assumed to be a moving aver- 
age process of a specified correlation interval I\ and vari- 
ance (steady-state). The effectiveness of the different least- 
squares algorithms is measured in terms of the power ratio 
(in dB) of the reconstructed (combined) signal to the total 
received power (which would be concentrated in the cen- 
tral feed element under ideal conditions). In addition, the 
performance of the least-squares multi-element combining 
algorithm is compared against traditional single-element 
processing. It should be noted that for all of these sim- 
ulation experiments, the noise variance at the combiner 
output is matched to the noise variance at the output of 
a single feed (the weighting parameter vector w is normal- 
ized to have unit norm). 

In the simulations reported here, it is assumed that 
the received signal is unmodulated with the signal field 
amplitude equal to 1 in the aperture plane, corresponding 
to a signal amplitude Aq equal to 16 at the center feed in 
the focal plane (ideal case). In case of data modulation, a 
decision-directed approach may be used to remove the data 
modulation from the received signal. For the purposes of 
simulations, the sampling period is normalized to 1 sec. 
Thus, the variance <r 2 of the sampled complex envelope of 
the noise at any of the feed outputs used in the simulations 
is given by 

a 2 = Al/N 0 fs (21) 


145 


where f s is the sampling frequency; No is the one-sided 
noise spectral density; P = Aq/ 2 is the received-signal 
power; and the algorithm’s performance is plotted as a 
function of the carrier-to-noise spectral density ratio: CNR 
= (P/N 0 ) (in dB-Hz). 

In Fig. 2, the reconstructed signal amplitude and 
phase estimates as a function of time measured in num- 
ber of samples for the least-squares algorithm for a CNR 
of 10 dB-Hz are plotted. It may be observed from Fig. 2(a) 
that there is a considerable signal loss compared to the 
received signal amplitude equal to 16 for the ideal case. 
In the simulations, the initial estimate for the parameter 
vector w is selected to be [a a . . . a] with a = 1/16. As 
is apparent from Fig. 2(b), the phase-estimation error of 
the reconstructed signal is much smaller compared to the 
rms phase error of 1 rad introduced in the simulated re- 
ceived signal field. Figure 3 plots the corresponding results 
for the modified le cist- squares algorithm for the same set 
of signal parameters and for 7o = 100. Comparison of 
Figs. 2(a) and 3(a) shows a very significant performance 
improvement due to the proposed modification of the least- 
squares algorithm. Results are plotted in Fig. 4 for the case 
of 20 dB CNR with the parameter To equal to 200. For this 
case, the signal amplitude and phase of the reconstructed 
signal are quite close to their respective values for the ideal 
case. The signal amplitude loss for this case is only 1.09 dB 
relative to the ideal case, and the rms phase error (after 
adaptive combining) is 0.1 rad. 

A. Simulation Results for Least-Squares 
Algorithm I 

Figure 5(a) plots the sample estimates of the signal 
power loss (compared to the ideal case) for the standard 
least-squares algorithm I. The signal loss Pl is simply com- 
puted as Pi = 201og 10 (A rm */16) with 



where A% is the reconstructed signal amplitude at the ith 
sampling instance, and M is the number of sample values 
selected to be equal to 200 for these simulations. Fig- 
ure 5(b) plots the signal estimation rms phase error 0 rm * 
computed as 


where 0, is the phase of the combined signal at the *th 
sampling instance. It is apparent from these figures that 
although the least-squares algorithm is optimal with re- 
spect to the prediction error criterion, it is not satisfactory 
in terms of the signal-to-noise ratio of the combined sig- 
nal. For the case of simulated phase dynamics, there is an 
asymptotic signal loss of 4 dB for the high CNR (~ 20 dB) 
case. 

Figure 6 plots the performance of the modified least- 
squares algorithm with To = 100 and with the same set 
of signal parameters as for the case of Fig. 5. The results 
are computed for three different values of the weighting 
coefficient: A equal to 0.925, 0.95, and 0.975. Compari- 
son of Figs. 5(a) and 6 shows a dramatic improvement in 
performance due to the proposed modification. Thus, the 
asymptotic signal loss for this case and with A = 0.925 is 
only 1.3 dB compared to a 4 dB loss for the standard least- 
squares algorithm. Increasing the value of To or reducing 
A may further reduce the signal loss. 

It is not difficult to understand this marked perfor- 
mance difference between the two algorithms. By mini- 
mizing the sum of the norm square of the estimated sig- 
nal error and the noise variance of the combined signal 
output, the standard least-squares algorithm produces a 
relatively large noise variance at the output. This is so 
because at high CNR the contribution of noise to the to- 
tal error is relatively much smaller than the contribution 
due to signal error (measured as a fraction of total sig- 
nal), and, thus, the former is essentially ignored by the 
algorithm. In the constrained least-squares algorithm, by 
increasing the relative weighting attached to the noise vari- 
ance, the signal-to-noise ratio at the combiner output is 
increased. In fact, as is shown in the Appendix, whereas 
the standard least-squares algorithm effectively maximizes 
the signal plus noise power output (subject to a near- 
orthogonality constraint), the constrained algorithm ac- 
tually maximizes the signal-to-noise power ratio (subject 
to a similar near-orthogonality constraint) at the combiner 
output. 

The signal loss results presented in Figs. 6(a)-(c) can 
also be used to infer the performance gains provided by 
the multi-element least-squares technique over center-feed 
processing. In particular, we can define the array process- 
ing gain as the difference (in dB) between the combining 
losses of the least-squares and center- feed curves. These 
data are plotted in Fig. 6(d). 
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As seen in Fig. 6(d), the array processing gain is a 
function of both CNR and A. Best results occur for large 
CNR (> 15 dB) and low values of A (A = 0.925). In par- 
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ticular, a maximum array gain of 3 dB (Fig. 6(a)) can be 
achieved for this simulation example. For lower values of 
CNR (below 0 dB), the estimation variance inherent in 
the least-squares algorithm degrades the array processing 
gain to such an extent that center-feed processing actually 
provides better performance. 

However, it must be stressed that the simulation 
example considered here corresponds to a relatively 
high dynamic scenario, e.g., as compared with typical 
mechanically- induced array degradations. Lower dynamic 
scenarios permit longer time constants for the least-squares 
algorithm (A — » 1 and/or longer sampling period), thereby 
reducing the algorithm estimation variance. Thus, for low 
dynamic scenarios, positive array gains can be achieved 
over a wider range of CNRs (including CNR <C 0 dB) by 
utilizing time constants that are essentially matched to 
the dynamics. For sampling periods with T different from 
1 sec, the results of Fig. 6 are applicable with the CNRs 
scaled by T. For instance, with T = 10 sec (typical for 
slower processes), the normalized CNR of 0 dB in Fig. 6 
will correspond to the actual CNR of —10 dB. 

It should also be noted that for a given scenario, in- 
creasing A to such an extent that the algorithm cannot 
track the dynamics will lead to degraded system perfor- 
mance. This can be clearly seen from Fig. 6, where it 
is observed that the array gain for A = 0.975 is approx- 
imately 1 dB less than for A = 0.925. Finally, it can be 
observed from Fig. 7 that in contrast to the array gain re- 
sults, there is little difference in rms phase error between 
the least-squares and center-feed outputs. 

B. Simulation Results for Least-Squares Algorithm II 

Figures 8(a) and (b) plot the performance of least- 
squares algorithm II in terms of signal reconstruction. In 
these figures, the dashed graphs depict the amplified signal 
amplitude at the output of various feeds, while the solid- 
lined graphs represent the amplitude of the reconstructed 
field, i.e., the magnitude of various components of h(k ) 
obtained by the circular convolution of the weight vector 
f(k) obtained from Eq. (19) or Eq. (20), and the received 
signal vector g(k), for two different time indices equal to 
30 and 185, respectively. As for the case of least-squares 
algorithm I, it is assumed that the weight vector / is nor- 
malized to have its norm equal to 1. This ensures that the 
noise variance at various points of the reconstructed field 
is equal to the variance of the input noise field and, thus, 
the comparison in terms of signal amplitudes is equiva- 
lent to comparing the reconstructed signal-to-noise power 
ratio. The results in Fig. 8 correspond to the same set 


of signal parameters as for least-squares algorithm I and 
a CNR of 10 dB. As is apparent from Fig. 8, the least- 
squares algorithm II focuses most of the signal power that 
is originally dispersed in 16 taps into the center tap. A 
more appropriate measure of the effective focusing is ob- 
tained by the signal amplitude of the center tap of the re- 
constructed field, which is plotted versus time in Fig. 9(a). 
In this case, only -5.2 dB of the received signal power 
is scattered in the other taps for the reconstructed field. 
Figure 9(b) plots the phase error of the center tap signal 
and shows an rms phase error of 0.12 rad. Figure 10 plots 
the corresponding results for the case of To = 100 and dif- 
fers insignificantly from the corresponding results in Fig. 9. 
Thus, the least-squares algorithm II simultaneously opti- 
mizes the signal-to-noise ratio. Figure 11(a) plots the sig- 
nal loss in the center feed of the reconstructed signal for 
the least-squares algorithm II. As shown in the figure, with 
the parameter A = 0.925, a loss of 1.25 dB can be achieved 
for the high CNR case, which is similar to that obtained 
for modified least-squares algorithm I. Figure 11(b) plots 
the corresponding results for the rms phase error of the 
reconstructed signal, showing an rms phase error of about 
0.12 rad at CNRs higher than 10 dB. 

Figure 12 plots the results when the algorithm is ap- 
plied to a configuration of feeds connected to amplifiers 
with different noise figures. For this example, the case 
wherein one-half of the total number of amplifiers have 
6.0 dB higher noise temperature than others is considered. 
The CNR in the figure is still measured with reference to 
the amplifier with the lower noise temperature. As may be 
inferred from Fig. 12(a), the asymptotic signal loss (CNR 
> 10 dB) in this case is 2.2 dB as opposed to 1.2 dB for the 
case in which all of the amplifiers have low noise temper- 
ature, thus resulting in only 1 dB additional degradation. 
Note that if all the amplifiers were replaced by ones with 
higher noise temperature, then the degradation would be 
about 3 dB with reference to this lower CNR, thus result- 
ing in an effective loss of 9 dB. 

It may be remarked that in the above presentation, the 
sampling period T has been normalized to 1 sec, but the re- 
sults are also applicable to different sampling periods by a 
simple normalization. As is apparent from Eq. (21), while 
increasing the sampling rate by a factor K , one should cor- 
respondingly reduce the actual CNR by the same factor to 
obtain the algorithm’s performance for this case. 

VII. Conclusions 

From the simulations presented in the article, it can 
be observed that for the relatively fast distortion process 


147 



with a moderate dispersion and the array geometry con- 
sidered, the multi-element array configuration provides an 
improvement of about 3 dB over a single-feed system. For 
a slower process with possibly higher spatial dispersion, 


the improvement is expected to be higher and to a certain 
extent will also be influenced by a match between the ar- 
ray geometry and the pattern of the received signal power 
dispersion. 
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FOCAL PLANE FEEDS 



Fig. 1. RF signal combining In focal plane. 
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RECONSTRUCTED SIGNAL PHASE 



ELAPSED TIME 

F!g. 2. Least-squares algorithm I: (a) reconstructed signal amplitude; (b) reconstructed 

signal phase. 
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RECONSTRUCTED SIGNAL PHASE 



Fig. 3. Modified least-squares algorithm I: (a) reconstructed signal amplitude versus time; 
(b) reconstructed signal phase versus time. 
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RMS PHASE ERROR SIGNAL COMBINING LOSS, dB 




Fig. 5. Least-squares algorithm f: (a) signal combining loss versus 
CNR for reconstructed signal; (b) rms phase error versus CNR for 
reconstructed signal. 




SIGNAL COMBINING LOSS, dB SIGNAL COMBINING LOSS, dB 



Fig. 6. Signal combining loss versus CNR for reconstructed signal: (a) A = 0.925; (b) A = 0.95; (c) A = 0.975. Also (d) modified 

least-squares array gain versus CNR. 


154 






RMS PHASE ERROR 



Fig. 7. rms phase error versus CNR for reconstructed signal: 
(a) A = 0.925; (b) A = 0.95; (c) A = 0.975. 




Fig. 8. Focal plane reconstructed field amplitude, least-squares 
algorithm II: (a) after 30 samples; (b) after 185 samples. 
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Appendix 


The following shows that the modified least-squares 
algorithm I of Section IV achieves constrained maximiza- 
tion of the signal-to-noise ratio. In the first instance, the 
time averages are replaced by the ensemble averages. 

Denoting by $i(k) the signal component of the ith 
array element output, consider the problem of minimizing 

N 

H = E |5-^u),*s<| 2 (A-l) 

*=1 

with respect to W { , i = 1,2,... AT. Setting the partial 
derivative of H w.r.t.tu,* to zero yields 

E (s-Y, < S «J *’ =0 (A-2) 

Now with 5 = w H $ and s ^ [si, S 2 > • • • > s/sr] 7 , the index H 
may be written as 

E [|S - 5| 2 ] = E [|5| 2 + |5| 2 - SS* - S*S ] (A-3) 

At the optima] point, the following is obtained from 
Eq. (A-2). 

E [(5 - §)§•] = 0 (A-4) 

Adding the left-hand side of Eq. (A-4) and its complex 
conjugate to Eq. (A-3) yields the following form for the 
optimization index, subject to the constraint, Eq. (A-4). 

H = |S| 2 - E [|5| 2 ] (A-5) 

Thus, the algorithm that minimizes Eq. (A-l) also maxi- 
mizes E [|5| 2 ], subject to the constraint, Eq. (A-4), i.e,, 
it is also a signal maximization algorithm. There may 
also exist solutions that optimize E |5| 2 without the con- 
straint, Eq. (A-4), but these may result in large phase error 
with respect to 5, the desired signal, i.e., 5 and Se J ^ (for 
any random phase <f>) both have the same value of the index 


E [iS| 2 ] but only one of these would simultaneously min- 
imize the error function, Eq. (A-l). It may be remarked 
that there are, in general, an infinite number of solutions 
that satisfy Eq. (A-4), and in effect these orthogonalize the 
estimate 5 and the “error” (5 — 5). Among these solutions, 
the one maximizing E |j5| 2 j is selected. Equation (A-4) is 
termed the orthogonality constraint. 

The optimization index, Eq. (A-l), does not include 
the noise variance at the output of the array combiner, 
which is given by cr 2 ||u;|| 2 where u 2 is the variance of 
Vi(k) } the noise at the input of the combiner. Thus, now 
Eq. (A-l) is minimized subject to the constraint 

IMI 2 = K (A-6) 

for some constant K , Or, one can simply minimize 

E [|S - u/'s| 2 ] + /? (Ml 2 - K) (A-7) 

where /? is the Lagrangian multiplier. An analysis similar 
to derivation of Eq. (A-5) shows that with a constraint 
similar to Eq. (A-4), the index is given by 

|S| 2 - i?[|5| 2 J - 2j3I< (A-8) 

for some constants fi and Thus, again the algorithm 
maximizes E |5| 2 j subject to the constraint that the out- 
put noise variance is equal to a constant Acr 2 , and thus 
effectively maximizes the output signal-to-noise ratio. 

Now from the independence of the received signal $,• 
and noise i/, it follows that 

E [|S - w H l\ 2 ] = E [|S - w H s| 2 ] + IH V 2 (A-9) 

Minimization of Eq. (A-9) subject to the constraint, 
Eq. (A-6), is thus identical to the minimization of Eq. (A- 
1) subject to the constraint, Eq. (A-6), and thus effec- 
tively maximizes the signal-to-noise ratio under the near 
orthogonality constraint. Now for the large value of k } 
the index k~ l J* with J* given by Eq. (3) approaches the 
left-hand side of Eq. (A-9) under appropriate ergodicity as- 
sumptions, and the algorithm of Section IV thus achieves 
constrained optimization of the signal-to-noise power ratio. 
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It may be remarked that the least-squares algorithm 
in the absence of the constraint, Eq. (A-6), effectively min- 
imizes the following index 


|S| 2 -£ |S | 2 


- (T 2 \\w \\ 2 


(A-10) 


subject to a constraint similar to Eq. (A-4). Since the last 
term in Eq. (A-10) represents the noise power at the com- 
biner output, it can be observed that the standard least- 
squares algorithm effectively maximizes the sum of the sig- 
nal plus noise power rather than the signal- to- noise power 
ratio. 


160 



S/s-'B 


TDA Progress Report 42-99 


N90- 19449 


93 2-0 

<=fS. 


November 15, 1989 


The Effects of Sinusoidal Interference on the Second-Order 
Carrier Tracking Loop Preceded by a Bandpass 
Limiter in the Block IV Receiver 


C. J. Ruggier 

Telecommunications Systems Section 


Drop-lock relationships for the second-order phase-locked loop are derived when 
the carrier and a sinusoidal interference signal lie within the predetection filter 
bandwidth of the Block IV receiver. Limiter suppression factors are calculated 
when a bandpass hard limiter is used to maintain constant total power at the loop. 
The parameters of interest are the interference- to-signal power ratio (ISR), the input 
signal-to-noise power ratio (SNR), and the interference signal frequency offset from 
carrier Af, Limiter suppression caused by the combined effects of the noise and 
the interference signal accounts for the variability in the drop-lock threshold for 
given values of the input SNR and ISR parameters. This article goes beyond earlier 
published work that focused on the limiter’s effect on the drop-lock threshold; it 
accounts for the limiter action in the interference mode and provides an overall 
improvement in the prediction accuracy of the drop-lock model . 


I. Introduction 

One major application of the phase-locked loop in a 
DSN receiver is tracking the carrier of the received signals 
[1]. The receiver phase-locks to the carrier and loses lock 
when the carrier margin drops below the lock threshold, 
or when an interfering signal is received at the critical am- 
plitude and frequency offset from the carrier. Although 
telecommunications links are designed with sufficient mar- 
gins to ensure performance requirements for the lifetime of 
the mission, interference can occur at any time. If the in- 
terfering signal power and frequency exceed the threshold 
limit, the carrier tracking loop drops the weaker carrier sig- 


nal and locks up to the stronger interference signal. This 
jump phenomenon is due to the inherent nonlinearities 
present in the phase-locked loop for conditions when the 
interference- to- noise power ratio (INR) is sufficiently high. 
As the INR decreases, the signal-to-noise ratio (SNR) be- 
comes the dominant factor, which can cause the loop to 
lose lock when it decreases below the noise threshold level. 
This article investigates the effect of the bandpass limiter 
when drop-lock of this type occurs. 

The carrier tracking loop employed in the Block IV re- 
ceiver consists mainly of a second-order phase-locked loop 
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preceded by a bandpass limiter. The hard limiter provides 
constant power at the input to the loop and effectively 
minimizes the total mean-square error of the loop over a 
wide range of the SNR. If, in addition to the noise, a sinu- 
soidal interference signal is inserted into the limiter along 
with the carrier signal, the interference signal will also con- 
tribute to the limiter suppression. The limiter’s effect on 
the drop-lock threshold becomes evident from its impact 
on the loop gain and loop interference-to-signal power ra- 
tio (ISR) loop input. Limiter suppression factors for these 
parameters are calculated and incorporated into the ba- 
sic drop-lock model to improve its prediction accuracy for 
large variations in the loop SNR. 


where p represents the operator d/dt , F(p) is the loop fil- 
ter, and K (1/sec) is the open-loop gain, which includes the 
VCO and the phase-detector loop gain. This nonlinear dif- 
ferential equation cannot be solved analytically; however, 
using the method of perturbations, solutions with best- 
approximation trial functions can be obtained. A steady- 
state trial solution is assumed to be 

<po(t) = A 4 0sin(Au>f 4 v) 

where A represents the static phase error, 0 is the phase 
deviation, and v is the phase shift. Bruno [2] derived the 
lock constraints as 


II. Carrier Drop-Lock Model 

Figure 1 shows a representative second-order phase- 
locked loop preceded by a bandpass limiter. Bruno [2] 
derived the loop equations for the case of a strong sig- 
nal and a sinusoidal interferer, without the limiter. The 
volt age- controlled oscillator (VCO) output is equal to 

2 COS | LOit 4 <^>o(0] 

where w\ is the VCO frequency (rad/sec) and <f>o(t) is the 
phase modulation due to the input through the loop action. 
The phase detector is assumed to be a perfect multiplier, 
and the loop filter has a transfer characteristic described 
as F(s). 


sin A = 


— 9 2 6 cos xp 

2J O (0) 


sin(A — v) — 


—6 2 b cos xp 
2v1SR h{6) 


( 2 ) 


66 sin ip 4- 2 J\(9) cos A 

WT^W) 


6 2 6 cos t p 

w 


= ISR 


( 3 ) 


where Jo? J i> and J 2 are Bessel functions of the first kind, 
xp is the phase angle of F(s), and 6 is the normalized offset 
frequency 


Aw 

WW\ 


Ignoring the effects of narrow-band Gaussian noise, 
the input to the loop consists of the sum of two sinusoidal 
components: 


A sin(w c t) 4 B sin(u> c 4 A w)t (1) 

where the first term of Eq. (1) is the wanted signal com- 
ponent with frequency w c having constant amplitude A 
volts when the limiter reaches a constant output. The in- 
terference component has an amplitude equal to B volts 
and is offset in frequency from th e sig nal component by an 
amount equal to Aw. Defining \/ISR as B/A , Eq. (1) can 
be rewritten as 


where s = j Aw, and K = the open-loop gain. 

Restricting the second-order loop filter with transfer 
characteristics, 


F(a) = 


1 + r 2 s 

1 + T\8 


where t, > r 2 . With Aw ;» l/r 2 , one obtains the reason- 
able approximation 


\F(s)\ 



for ip « 0 


A sin(u><^) 4 VlSR A sin(u> c t 4 Au>)2 
The output modulation (po(t) is given by 


The loop is expected to drop lock when the static phase 
error approaches 90 deg. Applying this condition and using 
Eqs. (2) and (3) with the condition that J\{6) « 9/2 and 
J o (0) & 1 for small phase deviations, the critical ISR can 
be given as 


<po(t) = - sin <po(t) 4 ISR sin (^Awt - </> o (0) 


(ISR) e = 


AttA f 
K (r 2 /ri) 
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This describes the drop-lock threshold for critical values of 
ISR and offset frequency A / without the limiter action. 


III. Calculation of the Limiter 
Suppression Factors 

The effect of the bandpass limiter also needs to be 
taken into account, when the carrier, interfering sinusoidal 
signal, and narrow-band Gaussian noise are present at the 
input to the limiter. Jones [3] calculated the autocorrela- 
tion function of the ideal hard limiter under similar con- 
ditions. The interaction of the two signals s \ and s 2 with 
noise generates a filtered output with autocorrelation func- 
tion given by 


r ( t )= E E E 2 

i = — oo t~ — oo fc=|i|, |*| + 2 


b 2 k\e\ / 

• — 1*+ 11 

p+l»-p 

V 2 J 



P\r) 


x cos 



| i + l|w 2 + ^(^2 — T 


where and u> 2 represent the frequencies of si and s 2 re- 
spectively, u c is the bandpass filter center frequency, and 
pk is the normalized noise-power envelope function con- 
taining both discrete (due to the period terms) and contin- 
uous components (associated with the output noise). The 
total power contained in these discrete components at the 
output is then given by 


°° r 

E 26§|/||/-l|co8[/(wa-«0-«3 


l = — OO 


and the output signal powers are given as 

(si)o = 2&010 = ^2 (77) 
and 

(s 2 )o = 26 001 = ^2 (^7) 


where T and 2 F\ are the gamma and confluent hypergeo- 
metric function, respectively. For the case where both the 
carrier and interference power are much greater than the 
noise power, the convergence properties of these equations 
become unstable. Then it becomes necessary to use the 
asymptotic forms 


and 



(*2)0 = 



I I.o. £1 
2’ 2’ ’ s 2 


2 


f £1 \ [£(1/2) 

W h( 3 / 2 ) 


2F1 



2' s 2 )\ 


for 


f2 

N 


00 ; — < 1; fc = 0 
si 


These relationships can be used to calculate the limiter 
suppression on the carrier power and the power ratio of 
the interference and carrier signal. For the case where 
interference is not present, the limiter suppression factor 
reduces to that calculated by Davenport [4] for a sinusoid 
and noise only. With interference present, the limiter sup- 
pression becomes affected by changes in both the ISR and 
SNR power ratios. Limiter suppression of the carrier sig- 
nal from the limited strong signal peak level, which results 
in a corresponding suppression of the loop gain, is given 

by 



Alternately, the limiter suppression of the output ISR with 
respect to the input ISR becomes 

(ISR)o 

(ISR),- 

Together these factors allow adjustment of the model’s 
critical values of ISR and frequency offset, at which car- 
rier drop-lock occurs. The combined effect of the limiter 
can now be given as the product of these two ratios. The 
threshold limiter suppression product is defined as 

_ rjL ] 1/2 ( ISR )o 
* “ 187T 2 J (ISR); 

showing that St is the suppression of loop gain times the 
suppression of interference-to-signal power ratio. Figure 2 
illustrates the threshold limiter suppression product for 
varying carrier margin values. 
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IV. Effect of the Bandpass Limiter 
on the Drop-Lock Model 

The limiter suppression product can now be used to 
determine the limiting effect caused mainly by the interfer- 
ence signal. The basic drop-lock equation can be written 
as 


ISR = 


4ttA / 
S t K(T 2 /n) 


Figures 3 through 8 show the critical ISR and frequency 
offset for the possible tracking loop modes of the Block IV 
receiver, with values of input SNR necessary to cause car- 
rier drop-lock. Table 1 lists the various loop mode parame- 
ters used to generate the curves. Note that as the interfer- 
ence power increases, the limiter suppression approaches 
a constant level. The slope of the drop-lock equation is 
affected only slightly by the increasing interference power 
for high SNR, and unaffected for low SNR. This observa- 
tion corroborates well with the trend shown in the mea- 
sured data and indicates that the limiter action, overall, 
produces a linear translation of the drop-lock threshold. 


V. Experimental Validation 

Drop-lock threshold tests were conducted indepen- 
dently at the Telecommunications Development Lab 
(TDL) and at the Compatibility Test Area (CTA-21). The 
purpose of the tests was to validate the drop-lock model 
with the Block IV receiver under the conditions of inter- 
ference described above. Only one tracking loop mode was 
tested for 2 Blo = 10 Hz. Figures 9 and 10 show the 
comparison of the measured data obtained from TDL and 
CTA-21, respectively, to the theoretical curves for varying 
levels of SNR at the limiter input. The measurements were 
restricted to frequency offsets from 10 to 1000 Hz. Initial 
ISR power ratio was set 5 dB higher than the loop thresh- 
old SNR {Cf*lBioNo ), where C is the carrier power in 
watts, 2 Blo is the two-sided loop threshold noise band- 


width in Hz, and No is the noise-power density in watts 
per Hz. 


VI. Conclusions 

It has been shown that when a bandpass hard limiter 
precedes the carrier tracking loop, limiter effects on the 
drop-lock threshold can be calculated for variations in the 
input SNR and ISR power ratios. Limiter suppression of 
the loop gain and ISR is evidenced in the responses of the 
drop-lock threshold for strong input signal variations, with 
varying combinations of levels in the input SNR and ISR. 
However, as the input ISR increases, the limiter suppres- 
sion stabilizes, exhibiting less sensitivity to varying levels 
in the input SNR. 

Drop-lock calculations for the possible loop modes of 
the Block IV receiver indicate that the loop becomes more 
susceptible to interference as the loop gain and loop band- 
width increase. The predetection filter noise bandwidth 
is also a factor. For example, the filter provides no at- 
tenuation of the interference signal when its frequency lies 
within the filter passband; for this particular case, the nar- 
row bandwidth modes only tend to reduce the drop-lock 
threshold. For larger interference frequency offsets outside 
the predetection filter passband, the calculations become 
more conservative due to the approximation of the filter’s 
transfer characteristics in the basic loop model. 

Experimental data from tests conducted at TDL and 
the CTA-21 facility show good agreement with the theo- 
retical calculations. The model shows a tendency to un- 
derpredict at the low SNR values, which is inherent in the 
assumption of negligible noise in the derivation of the ba- 
sic model. On the other hand, the model overpredicts for 
higher SNR values between 100- and 1000-Hz frequency 
offsets, attributable largely to the limiter model which was 
assumed to be ideal. The model shows the best agreement 
with the measured data (within 1 dB) for smaller frequency 
offsets and loop SNR of 10 dB or greater. 
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Table 1. Block IV receiver tracking loop modes 3 


2 BiOy 
Hz 

Mode 

K 

Open-loop gain, 
1 / sec 

12. 

n 

BW, kHz 
Predetection 
Filter 

1 

Narrow 

9.6089 x 10 5 

4.4340 x 10~ 5 

0.200 

3 

Narrow 

9.6009 X 10 5 

7.7057 X 10~ s 

0.200 

10 

Narrow 

9.6089 X 10 5 

4.4343 X 10“ 4 

2.0 

10 

Wide 

9.6089 X 10 5 

4.4343 X 10~ 5 

2.0 

30 

Wide 

9.6009 X 10 6 

7.7057 X 10“ 5 

2.0 

100 

Wide 

9.6089 X 10 6 

4.4343 X 10" 4 

20.0 

300 

Wide 

9.6009 X 10 6 

7.7065 X 10“ 4 

20.0 


a Prom “Receiver/Exciter Block IV Equipment, Subsystem Detail 
Specifications,” Doc. ES 505736, Rev. A (internal document), Jet 
Propulsion Laboratory, Pasadena, California, October 1974. 
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Fig. 1. Carrier tracking loop drop-lock model. 
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Fig. 2. Threshold limiter suppression product for various carrier 
margin values. 



Fig. 3. Critical ISR versus frequency offset with input SNR values 
causing carrier drop-lock, fracking loop Mode 1, 2B LO — 1 Hz. 
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Fig. 4. Critical ISR versus frequency offset with Input SNR values 
causing carrier drop-lock, tracking loop Mode 2, 2B LO — 3 Hz. 
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Fig. 5. Critical ISR versus frequency offset with Input SNR values Fig. 7. Critical ISR versus frequency offset with Input SNR values 
causing carrier drop-lock, tracking loop Mode 3, 2B L0 = 10 Hz. causing carrier drop-lock, tracking loop Mode 5, 2B L0 = 100 Hz. 



Fig. 6. Critical ISR versus frequency offset with input SNR values 
causing carrier drop-lock, tracking loop Mode 4, 2B LO = 30 Hz. 



Fig. 8. Critical ISR versus frequency offset with input SNR values 
causing carrier drop-lock, tracking loop Mode 6, 2B LO = 300 Hz. 


168 






10 log 10 (//a dB 10 log 10 {HO, dB 



Fig. 9. Comparison of TDL data to theoretical levels of SNR at the limiter input, tracking 

loop Mode 3, 2B LO ~ 10 Hz. 



Fig. 10. Comparison of CTA-21 data to theoretical levels of SNR at the limiter Input, tracking 

loop Mode 3, 2B LO = 10 Hz. 
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NASA/JPL 70-Meter Antenna Axis Servos 
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Analytic methods for evaluating pointing errors caused by external disturbance 
torques are developed and applied to determine the effects of representative values 
of wind and friction torques. The expressions relating pointing errors to disturbance 
torques are shown to be strongly dependent upon the state estimator parameters, 
as well as upon the state feedback gain and the flow-versus-pressure characteristics 
of the hydraulic system. Under certain conditions , when control is derived from an 
uncorrected estimate of integral position error , the desired Type II servo properties 
are not realized and finite steady-state position errors result. Methods for reducing 
these errors to negligible proportions through the proper selection of control gain 
and estimator correction parameters are demonstrated. The steady-state error pro- 
duced by a disturbance torque is found to be directly proportional to the hydraulic 
internal leakage. This property can be exploited to provide a convenient method of 
determining system leakage from held measurements of estimator error , axis rate , 
and hydraulic differential pressure. 


I. Introduction 

Recent studies of mechanisms contributing to limit- 
cycle behavior have led to a need for more accurate mod- 
eling of the disturbance torque response characteristics of 
the 70-m antenna axis servos. Of particular interest in 
the limit-cycle studies is the transient behavior of the var- 
ious plant and estimator states that occur during friction- 
induced limit cycling. The traditional assumption that the 
estimator states accurately track those of the plant be- 
comes inaccurate when the plant is subjected to external 
disturbance torques. This shortcoming necessitated the 
development of the present multivariable axis servo model 
where the plant and estimator states are distinct. 

Precise pointing of the 70-m antenna is accomplished 
through the use of a two-axis, azimuth-elevation, bullgear- 


driven servo system. Control torques are produced by 
fixed-displacement, axial-piston hydraulic motors, which 
are coupled to the axis bullgears through spur-gear reduc- 
ers. Four such motor/gear reducers are employed for each 
axis. Backlash is eliminated by separate countertorque mo- 
tors which apply a constant torque bias to the output pin- 
ion of each control motor. The hydraulic connections to 
the countertorque motors are arranged as shown in Fig, 1 
so as to preload all four gear reducers and apply zero net 
torque to the bullgear. Torque modulation is accomplished 
by four port-hydraulic servo valves. Servo control con- 
sists of a hardware rate loop with tachometer feedback and 
a computer-based position servo employing state-variable 
feedback. 

The servo model is based on the nonlinear orifice-flow 
equation of the valve, along with the motor-flow equation 
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as described in [1]. A piecewise linear representation of 
the valve is obtained by partial differentiation of the valve 
equation (Eq. 5 of [1]) to yield the flow Q v as a linear 
function of the valve input current I v and load pressure 

Pl : 

Qv = K P I V - D h P L (1) 

with 

K P = K v P l J 2 



and 

P v = Ps-Pl- Pr 

where Ps and Pr are the regulated system supply pres- 
sure and return pressure, respectively. As shown later, the 
load pressure is sufficiently small relative to the supply 
pressure such that the range of the valve pressure remains 
within a 2 to 1 ratio. This justifies the use of a constant 
value approximation for the flow gain Kp for control dy- 
namic analysis. In contrast, the equivalent damping Dp 
is linearly proportional to the flow, which varies in pro- 
portion to antenna rate. Therefore, Dp varies between a 
minimum value corresponding to leakage flow at the zero 
antenna rate, and a maximum corresponding to the maxi- 
mum tracking rate of the antenna. 

The hydromechanical system model incorporates the 
linearized valve of Eq. (1) along with a motor coupled to a 
rigid-body inertia load representation of the antenna struc- 
ture, as shown in block diagram form in Fig. 2, where Cr , 


Jm , and Vm represent the hydraulic compressibility, motor 
inertia, and motor displacement, respectively. 

The block diagram of the equivalent plant (servo-loop 
hardware) in Fig. 3 incorporates the model of Fig. 2, the 
tachometer feedback and control amplifier with associated 
compensation networks, and two additional integrations 
which produce the angular position and position integral 
states. The two inputs represent the electrical rate com- 
mand input to the plant and an equivalent external dis- 
turbance torque; I\r represents a constant with value pro- 
portional to the rate loop gain, and P*x and Z 2 are the pole 
and zero frequencies of the rate loop compensation net- 
work. Figure 3 also includes a simplified equivalent of the 
tachometer feedback network obtained by neglecting the 
network pole, which is at a relatively high frequency. This 
approximation introduces the acceleration feedback branch 
shown in Fig. 3 where the parameter Z\ corresponds to the 
negative real frequency of the network zero. 

To provide additional insight into the effects of rate 
and position loop parameters on system compliance, com- 
pliance equations are developed separately for the open- 
position loop case, for the hardware position proportional, 
integral, and derivative (PID) feedback, and for the closed- 
loop state variable controller. It is shown that the combi- 
nation of plant-state and estimator-state feedback in the 
precision mode leads to compliance characteristics different 
from those of the computer mode, even when both modes 
have identical plant and estimator dynamics. 

II. Compliance Equations for the Open 
Position Loop Case 

The open position loop compliance can be derived 
from the angular position response to a unit-step torque 
input. By application of the Mason transmittance rule to 
the block diagram of Fig. 3, the compliance transfer func- 
tion becomes: 


<3m ____ (1/Ja/s 2 )(1 + Dr/Crs)( 1 4- Pifs) 

Tx (1 + p2/s)(l + Dr/Crs + Kr/Ch Jm Z\s 4- (Vjf f + Kr)/ JmCrs 2 ) + (^2 — P 2)(A r/ Cr J m s 2 )(1 / 5 + l/^i) 


which leads to 


0 A/ (l/s)(s + Dr/Cr){s + Pi) 

T X ~ Jm(s + P 2 )(s 2 + (Dr/Cr)s + V*/J m Cr) + (Kr/CrZ^s + Z^s + Z 2 ) 


171 


Application of the final value theorem to Eq. (2) in- 
dicates a constant steady-state rate in response to a step 
function torque disturbance. The steady-state compliance 
is given by 

6m _ A# 

T x ~ s(V^ + K r Z 2 /P 2 ) 


III. Compliance Equations for Hardware PID 
Feedback 

The compliance of the closed-loop PID feedback sys- 
tem can be calculated by the same method used earlier for 
the open-loop case. The addition of position, integral, and 
rate feedback branches produces three additional denomi- 
nator terms in the compliance expressions. Thus, 


0m = (1/J m s 2 )(1 + D h /Chs)(1 + Pa/s) 

T x (1 + P 2 /s)( 1 + Dh/Chs + Kr/ChJmZ x s + {V& + K r )/J m C hS 2 ) 

+ (Z a - P2 )(K r /C h J m s 2 )(1/s + 1 jZ x + K x /s 3 + K 2 /s 2 + K a /s) 

+ (1 + P 2 /s)(Kr/C„J m s)(K 1 /s 3 + Ka/s 2 + I< 3 /s) 


which leads to 


©m _ s(s + Dh/Ch)(s + P2) 

Tx ~ Jms 2 (s + P 2 )(s 2 + (D H /C n )s + V*/J M C„) 

+(Kr/C h )(s + Z 2 )(s 2 + s^/Zx + K x + Kas + A' 3 5 2 ) 


( 3 ) 


and the steady-state compliance becomes 

©m _ Dhs 
T x “ KiKr(Z 2 /P 2 ) 


The compliance transfer functions of Eqs. (2) and (3) in- 
clude numerator zeros corresponding to the network pole 
P 2 and to the effective hydraulic damping Dh/Ch. The 
denominators are seen to contain the poles of the respec- 
tive open/closed-loop system. From the dependence of Du 
on hydraulic flow, the steady-state compliance properties 
are seen to be antenna-rate dependent. For typical closed- 
position loop parameter values, the damping Dr has rel- 
atively little efFect on the closed-loop poles. Therefore, 
since the shape of the compliance transient is determined 
by the locations of the transfer function zeros relative to 


the poles, the closed-loop transient properties are also seen 
to vary as a function of D//. 

IV. Compliance Equations for the State Vari- 
able Controller 

Earlier methods of disturbance torque effects analy- 
sis [2] were based on the assumption of negligible errors 
in the estimator states in relation to the corresponding 
plant states. This permitted a simplification of the sys- 
tem model by substituting plant-state feedback in place of 
the estimator-state feedback, thereby reducing the number 
of states required in the model. This assumption, widely 
used in evaluating command input transient responses, was 
found to produce finite errors in the determination of dis- 
turbance transient responses of the axis servos. It was 
subsequently replaced by a superior method employing full 
modeling of the estimator states as well as the plant states. 
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x = Ax + B?7 


The model for disturbance torque response of the 
state-variable axis servo controller is based on a linearized 
multi-input state- variable representation of the system of 
Fig. 3, with the addition of the state-estimator and control- 
feedback gain. The plant state is represented by the gen- 
eralized state equations, where x and Y are the state and 
output by 


Y = Cx + T>U 

respectively. 

The corresponding A, B, C, D matrices from Fig. 3 

are: 


A = 


0 10 0 
0 0 1 0 
0 0 0 1 Jm 

0 0 —Vm/Ch - Kr/Ch —Dh/Ch — Kr/CrJmZi 
0 0 —1 —1/JmZi 


0 

0 

0 

Kr(Z 2 - P 2 )/C h 


-Pa 


B = 


0 

0 

0 


0 

0 

1 / J M 


Kr/Cr — Kr/CrJmZi 


1 — 1/JmZi 


c = 


0 


10 0 0 


D = 


0 0 
0 0 
0 0 
0 0 
0 0 


Representing the estimator state by x, and estimator out- 
put by Yj the estimator equations become: 

x = A£-hB l U 1 + L(Y-Y) 

Y = Cx + D 1 Ui 

where 2?i, B 2) D 2) U i, and U 2 represent the first and 

second columns of B and D and the first and second ele- 
ments of U, respectively. This distinction is essential be- 
cause both inputs couple directly into the plant, while only 
the rate command input U\ couples directly into the esti- 
mator. 


V. Computer Control Mode 

Because the rate command U\ is formed differently for 
the precision mode, the two control modes are addressed 
separately. Incorporating the control- feedback gain K into 
the expression for the rate command input XJ\ for computer 
mode, and substituting into the plant and estimator equa- 
tions leads to 

Ui = -Kx 

x — Ax — B\Kx + B 2 U 2 (4) 

x = LCx + (A - B X K -LC- LDi K)x + LD 2 U 2 


With the estimator error, 


prising the plant and estimator states leads to the matrix 
differential equation form 


X — X — X 

i = (A — LC)x + LD 1 I<x + (B 2 - LD 2 )U 2 



A 

-BiK 


X 

+ 

' b 2 

LC 

K-BiK-LC 


X 


0 


U 2 (7) 


Since for the present case both D\ and Do are zero, the 
estimator and estimator-error equations simplify to 

x = LCx + (A - BiK - LC)x (5) 

and 

x = (A - LC)x + B 2 U 2 (6) 


which is compatible with existing linear system analysis 
and simulation software tools. The steady-state distur- 
bance torque compliance properties of the combined plant 
and estimator system can be determined from the steady- 
state solution of Eq. (7). The general expressions for the 
steady-state values of the individual plant and estimator 
states can be determined by a symbolic expansion of the 
determinants resulting from application of Simpson’s Rule 
to Eq. (7). 


The equations for the plant and estimator are illus- 
trated in the state-space block diagram in Fig. 4. Com- 
bining Eqs. (4) and (5) to form a single state vector com- 


Since, in the steady state, the derivatives represented 
by the left-hand side of Eq. (7) equal zero, steady-state 
position estimate £2 becomes 


• det 


U 2 


-Biki 


1 

1 

Ax | 

LC | 

I 

-Biki-LCi | 

1 



Bo 


B 1 


&3 & 4&5 


( 8 ) 


0 I A3 5 — B\ &5 

0 i 
0 I 


det 


A -BiK 
LC A -BxK-LC 


where the numerator matrix is obtained from the 10 x 10 
denominator matrix by replacing the column correspond- 
ing to £2 (column 7) with the right-hand-side vector, in 
accordance with Simpson’s Rule. The subscripts applied 
to capital-letter symbols designate the respective columns 
of the associated matrix, and As } $ denotes a matrix com- 
prised of the third through fifth columns of A. 

A row by row examination of the numerator matrix re- 
veals that the sixth row is comprised of the five elements of 


the first row of LC , asingle element (aii — 6nfci— l\Ci) t and 
the last three elements of the first row of [A — BiK — LC ]. 
It will be seen that, since the first element of L is zero in the 
current parameter set, and because of the sparseness of A 
and B, the elements of the sixth row are all zeros and the 
numerator determinant vanishes. This indicates that, in 
the presence of a constant disturbance torque, the steady- 
state position estimate £2 is identically equal to zero. This 
in turn indicates that the final values of the position £2 and 
the position estimation error £ 2 are identical, and £ 2 can 
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thus be evaluated using Eq. (6). This approach avoids the 
complexity of evaluating the determinant of the 10 x 10 
matrices of Eq. (8). 

Thus, in the steady state, using Eq. (6), 


x = (A - LC)x + B 2 U 2 = 0 


and 


— det [Ai B 2 A3 A4 A5 


det A 


-LC 


-U 2 


Note that L and C are absent from the numerator because 
all elements of C, except for the second, are zero. Note also 
that the first row of A and the first element of C are zero, 
causing both determinants to vanish. However, since the 
plant integral state is uncoupled from both the plant and 
the estimator, the corresponding first rows and columns 
can be deleted from both the numerator and denominator 
with no loss in generality. It should be noted that if the 
first element of C is assigned a nonzero value, the denom- 
inator determinant remains finite as long as L is nonzero. 
The implications of this integral position feedback will be 
discussed later. 

Substituting the values of A, B 2y and C into the ex- 
pression for x 2y 


0 0 

1 /Jm 0 

—Dh/Ch — A r / C}{ J mZ\ K r (Z 2 — P2)/ Ch 

— 1/ JmZ 1 —P 2 

u 2 -l 2 10 0 

— I3 0 1 /Jm 0 

det 

—U —Vm/Ch — I< r /Ch —Bh/Cjj — Kfi/CuJ mZi Kr(Z 2 — P 2 )/Cr 
— / 5 —1 — P 2 

Expanding the determinants leads to the nonzero steady-state position estimate error x 2) and since the steady-state 
plant position x 2 equals x 2y 


0 1 

— 1 /Jm 0 

I<r/ChJmZ 1 -Vm/Ch - K r /Ch 

1/JmZi —1 


x 2 Dh 

U 2 “h Ar Z 2 /P 2 ) + lz(Du Jm + KrZ 2 /Z\P 2 ) -f I^Cr + ^5(^2/ P2 ~ 1 ) 


which, using current 70-m antenna servo parameter values, 
can be approximated by 

fi ~ 

U 2 Kr1 2 

This result invalidates the original premise that control 
feedback of estimated integral error is equivalent to feed- 
back of plant integral error in imparting Type II servo 
performance. That premise is valid only for inputs, such 
as the position command, which are coupled equally to the 
estimator and the plant. From Eqs. (3) and (4) and the 


block diagram of Fig. 4, it is seen that the plant is influ- 
enced by the control input U\ and a disturbance input U 2y 
while the estimator is influenced by the same control input 
and by the estimator feedback error LCx . In equilibrium, 
the plant disturbance U 2 is compensated by the control in- 
put U\\ thus, the identical U\ input to the estimator must 
be counteracted by the error LCx in order to obtain es- 
timator equilibrium. This implies that a finite estimator 
error x will always result from a plant disturbance input, 
while the individual components of x will be determined 
by the product of L and C . This line of reasoning explains 
the absence of the control gain K from the steady-state 
compliance expression. 
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In the present case, where there is no feedback of 
xi, the integral estimate error is allowed to grow un- 
bounded. The addition of a small amount of plant integral 
feedback to the estimator by assigning a small value to c\ 
would bound the error X\ , thereby forcing x 2y £2, and £2 
to zero. The required nonzero x would then result from 
nonzero values of the components other than x% and X3. 
This additional feedback imparts the desired disturbance 
accommodating control [3] properties to the system. 


VI. Precision-Control Mode 

In the precision (autocollimator feedback) mode, the 
control input U 2 is derived from the autocollimator, an 
electro-optical device that senses plant position error di- 
rectly. This error signal is filtered to remove noise and 
then integrated. The resulting plant integral and position 
errors are then combined with additional damping terms 
provided by the position estimate to form the plant input 
according to 


the steady-state estimator error x is therefore identical in 
both the computer mode and precision mode. However, 
the altered form of the control gains Kp and Kp in the 
precision mode change the coupling of the integral position 
x\ and its estimate x\. Accordingly, when solving Eq. (9) 
for final values, the row and column corresponding to the 
integral estimate x\ are deleted, and those corresponding 
to the integral x\ are retained. As a result, the final value 
solution of Eq. (9) yields the expected zero value for the 
position X 2 and a nonzero position estimate x*i- This result 
implies a nonzero estimator error x consistent with Eq. (6). 

These observations demonstrate an interesting duality 
between the two control modes, whereby the substitution 
of plant states for their corresponding estimator states in 
the control gain law results in the interchange of the fi- 
nal values of plant and estimated position variables. This 
tends to confirm the validity of the general expressions of 
Eqs. (7) and (9), and indicates that in the precision-control 
mode, a zero steady-state error always results from a dis- 
turbance torque input. 


JJ\ — — k\X\ — k 2 x 2 — £3X3 — &3X4 — k$x 5 

The resulting system equations therefore take a slightly 
different form from that of Eq. (7) due to the mixed plant 
and estimator feedback. Thus, for precision mode, 


x | | A — B\Kp -BiK e 

x I \LC-BiKp A -B l K E -LC 


x 

X 


VII. Compliance Transient Properties 

The shape of the transient response to disturbance 
torques can be inferred from the relative locations of the 
poles and zeros of the compliance transfer functions for 
the various system configurations. For the open-loop and 
hardware PID feedback systems, the poles and zeros are 
defined by Eqs. (2) and (3), respectively, where it is seen 
that the poles are the same as those associated with the 
respective command input-output transfer functions. 


+ 


B 2 — B\ KpD 2 

0 


U 2 


( 9 ) 


where Kp and Kp are the gain vectors associated with the 
plant and estimator states respectively. For the precision 
mode, 


For the state variable controller configurations, the 
poles are the respective eigenvalues of the square system 
matrices in Eqs. (7) and (9). The corresponding zeros are 
the complex frequencies of zero response of x, x, and Y for 
any disturbance input U 2 and initial condition. Thus, for 
the computer-mode configuration of Eq. (7), the response 
zeros are the roots of 


' K P 


k\ k 2 0 0 0 

Ke . 


0 0 h &5 


The B\KpD 2 term is included in Eq. (9) to maintain gen- 
erality, even though the KpD 2 product is always zero in 
the absence of plant acceleration feedback. 

From Eq. (9), it is seen that Eq. (6) for the estimator 
error is also applicable to the precision mode case, and 


(si — A) -BiK -B 2 ' 


X 


"0" 

LC (si — A + BJ< + LC ) 0 


X 

- 

0 

c 0 0 


u 2 _ 




To avoid the longhand expansion of the above determinant, 
the zeros were determined by a numerical evaluation of 
the transfer function zeros of the system of Eq. (7) with 
coefficient values representative of the 70-m azimuth servo. 
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The results coincide precisely with the two nonzero real 
roots of the determinant 

’ [si - A] B 2 
C 0 

at — Dh/Ch and — P 2 (the root at zero is absent), and with 
the five eigenvalues of [A — B X K — LC\ . A numerical com- 
putation of the eigenvalues of the square matrix of Eq. (7) 
shows that the accompanying poles are the five eigenvalues 
of [A — B\K ] and the five eigenvalues of [A — LC ]. 

For the precision-mode configuration of Eq. (9), the 
response zeros are the zeros of 

[si — A + B\Kp] —B\Ke B 2 

LC-BxKp [si- A + SiA^ + ZC] 0 

C 0 0 



X 


'o' 

X 

X 

= 

0 


u 2 


0 _ 


Numerical evaluation of the transfer function zeros indi- 
cates two real zeros at —Dh/Ch and — two zeros 
at the origin (one of which is canceled by a pole), and 
four additional real and complex zeros which are near, 
but noncoincident with, four of the poles of the square 
matrix of Eq. (9). This result indicates that unlike the 
computer-mode case of Eq. (7), the precision mode ex- 
hibits zero steady-state disturbance torque compliance. In 
addition, the low-frequency zero from the eigenvalues of 
[A — B\K — LC\ arising from Eq. (7) is absent in this 
case, resulting in a faster disturbance recovery transient 
in the precision mode. The system poles computed from 
Eq. (9) are identical with those from Eq. (7) for the com- 
puter mode. 

From the foregoing and from Eqs. (2) and (3), it is 
seen that the real zeros at — Dh/Ch and — P 2 appear in 
the compliance expressions for the open-loop and PID feed- 
back systems, as well as for the computer and precision- 
mode state variable controllers. The differing steady-state 
compliance properties of these four configurations arise 
from the presence of poles or zeros at the origin. This 
commonality implies that all four configurations exhibit 
identical disturbance transient characteristics for a short 
time interval following the transient. This in turn implies 


that the initial transient characteristics of the state vari- 
able controllers are governed solely by the plant parame- 
ters, and the departure from this initial characteristic is 
governed by the controller. 


VIII. Numerical Results 

Numerical evaluations of the differential equations in- 
corporated the physical parameter values of Table 1, the 
control coefficients of Table 2, and the disturbance torque 
parameters of Table 3. Physical parameter values of Ta- 
ble 1 were derived from component specifications accord- 
ing to [1] with the rate loop gain and network parameters 
based on [4]. The Kr value of 5.107 corresponds to a rate 
loop dc gain product of 40. The values shown for damp- 
ing Dh are calculated using Eq. (1) and correspond to the 
axis rate range of 1 to 40 times the sidereal rate with added 
leakage Qhl* Aerodynamic parameters used in determin- 
ing wind torques were extrapolated by McGinness 1 from 
earlier wind-tunnel tests performed on scale models of the 
64-m antenna. 


The position feedback control gain K and estimator 
gain L shown in Table 2 were calculated according to the 
optimal control criteria of Alvarez and Nickerson [5]. The 
baseline values L\ and K 4 in the table were calculated to 
duplicate the dominant closed-loop poles of [5] when ap- 
plied to the model of Fig. 3. Alternate parameter sets 
were calculated in a similar manner with weighting ad- 
justed to shift the dominant poles inward or outward from 
the origin. Those estimator gains that include integral 
terms were calculated using an alternate output vector 
H — [1 1 0 0 0], which included equal weighting of 

integral and position. 


The wind torque moments in Table 3 were calculated 
according to McGinness 2 using the torque relationship 


torque — 


CtI^D\ 

4 


(10a) 


where Ct is the aerodynamic lift coefficient, the dynamic 
wind pressure is 


Q = 



(10b) 


with air density, p = 0.00238 lb /ft 3 , and wind velocity, v. 


1 II. McGinness, “Effects of Wind Loading on 64- And 72- 
Meter Diameter Antenna” (JPL internal report), Reorder 
No. 84-2, May 1984. 

2 McGinness, ibid. 
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From Eqs. (10a) and (10b), the disturbance torque is 
seen to be a quadratic function of the wind velocity, so the 
disturbance imparted by a step change in velocity depends 
on the initial and final velocities rather than the step am- 
plitude. Thus, if is the mean of the initial and final 
velocities and vj is their difference, the torque disturbance 
is proportional to the product v m Vd . The worst-case dis- 
turbance then results when a large step-velocity change is 
superposed on a large mean velocity such that their prod- 
uct is a maximum. Assuming 30 mph as the maximum 
average wind and 12 mph as the maximum gust, the worst- 
case disturbance corresponds to a step change between 24 
and 36 mph. 

Breakaway friction torque levels were extracted from 
acceptance-test data recorded when the overseas antennas 
were built in 1972. The range of values shown is consistent 
with more recent informal reports of observations at the 
Goldstone site. 

The differential hydraulic pressures corresponding to 
the disturbance torques are included in Table 3. Because 
of its linear relationship to torque and its ease of direct 
measurement, the differential pressure has become a fa- 
miliar unit of torque measurement to those working with 
the antenna. 

The disturbance torque transient responses of the 
open-loop, computer-mode, and precision-mode configu- 
rations obtained from the time solutions of Eqs. (7) and 
(9) with I\ = K , i and L = L\ are shown in Fig. 5. The 
disturbance input corresponds to a 12-mph wind step com- 
bined with a 30-mph mean wind. The open-loop case ’was 
obtained from Eq. (7) with zero value of control gain K , 
The comparatively rapid recovery of the precision mode 
and the finite steady-state error of the computer mode are 
clearly visible in the time responses. The slower recov- 
ery of the computer mode is attributed to a low-frequency 
transfer function zero arising from Eq. (7), which is absent 
from the precision-mode case. 

Figure 6 shows the computer-mode disturbance torque 
transient response for various antenna rates. The re- 
sponses were generated from Eq. (7), where the 
term in the A matrix was adjusted according to Eq. (1) for 
each antenna rate. The effect of the changing Dh / Ch zero 
in the transfer function is evident from the change of the 
transient shape and of the final value. Figure 6 can also 
be used to judge the effects of increased hydraulic leakage 
by converting a known volumetric leakage to an equivalent 
antenna rate through the hydraulic displacement and gear 
ratio. The characteristic corresponding to the sum of the 


leakage equivalent rate and actual rate then represents the 
system behavior. 

Figure 7 shows the computer-mode transient response 
for the various values of control gain K listed in Table 2. As 
indicated earlier from the general properties of Eqs. (6) and 
(7), the gain K is seen to influence the speed of recovery, 
but has no effect on the final value. 

Figure 8 shows the computer-mode transient response 
for the various values of estimator gain L listed in Table 2. 
The effect of those L vectors having finite integral error 
correction terms l\ is seen in the faster recovery time, as 
well as reduction of the final value. The final value for 
those cases is still finite due to the absence of position 
integral feedback. 

Figure 9 shows the precision-mode transient response 
for the various values of control gain I\ listed in Table 2. 
Increasing values of K are seen to decrease the peak error 
and speed the recovery transient. 

Figure 10 shows the precision-mode transient response 
for the various values of estimator gain L listed in Table 2, 
As expected, the estimator gain L has little effect on the 
compliance transient when the response of the estimator 
is faster than that of the control gain. 

IX. Summary and Conclusions 

The transient disturbance-torque rejection properties 
of the 70-m axis servos are shown to be governed by phys- 
ical hardware parameters, as well as by the properties of 
the software-control algorithm. In particular, the effective 
hydraulic damping Dfj/Cjj , which is strongly affected by 
hydraulic leakage as well as by axis rate, determines the 
peak transient error, and in the case of the computer mode 
it also produces a finite steady-state error. The peak and 
steady-state disturbance-torque errors are essentially un- 
affected by the control gain K } but can be reduced by 
increased estimator gain L at the expense of noise rejec- 
tion. 

The precision-control mode, by virtue of the direct 
feedback of hardware (as opposed to estimated) position 
and integral errors, is shown to have disturbance torque re- 
jection properties superior to those of the computer mode. 
This deficiency of the computer mode results from the 
omission of the position integral error from the estimator 
equations, and can be corrected by modifying the equa- 
tions to include the integral. Significant improvement can 
also be obtained by modifying the estimator gain parame- 
ter L to include a coefficient corresponding to the integral 
position estimate. 
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Table 1. The 70-m antenna axis servo parameter values 


Symbol 

Parameter 

Units 

Value 

Jma 

Inertia moment, azimuth 

in.-lb-sec 

12.0 

Jme 

Inertia moment, elevation 

in.-lb-sec 

8.0 

V h 

Hydraulic displacement 

in. 3 /radian 

1.53 

Ch 

Hydraulic compressibility 

in. 3 /psi 

0.00314 

Dh 

Hydraulic damping 

in. 3 /psi-sec 

0.00188 to 0.0754 

Qhl 

Hydraulic leakage 

in. 3 /sec 

1.67 

N 

Reduction gear ratio 

dimensionless 

28730 

Pi 

Tach network negative pole frequency 

sec -1 

80 

P2 

Lag network negative pole frequency 

sec -1 

0.24 

Zi 

Tach network negative zero frequency 

sec” 1 

5.0 

z 2 

Lag network negative zero frequency 

sec -1 

4.4 

Kr 

Rate loop gain constant 

in. 6 

5.11 

K v 

Valve flow constant 

in. 3 /mA-sec/psi 0,6 

0.731 

Pv 

Valve pressure drop 

psi 

2750 

Cta 

Aerodynamic lift coefficient, azimuth 

dimensionless 

0.1217 

Cte 

Aerodynamic lift coefficient, elevation 

dimensionless 

0.1231 

D a 

Reflector aerodynamic diameter 

m 

70 


Table 2. System, control gain, and estimator parameters 


(a) System matrices 

"o 1.0000 

0 



0 

0 



0 0 

1.0000 

0 

0 


A = 

0 0 

0 



8.3333 

0 



0 0 

-23.7194 

-27.7072 

13.2484 



0 0 

- 5.1070 

-8.5117 

-0.2400 



0 

0 






0 

0 





B = 

0 

0.0833 






16.2643 

-0.2711 






5.1070 

-0.0851 





C ” | 

^0 1.0000 

0 0 0 




| 

D = ; 

H 







(b) Control gains K 
Control gain K 

Numerical value 

Control poles from eigenvalue (A — BA ) 


0.5200 

-12.2372-f 12.5297t 


1.3417 

-12.2372 - 12.5297t 


0.5712 

-2.5455 


0 

— 0.4636 + 0.4268* 


0 

-0.4636 - 0.4268* 

k 2 

0.3162 

-12.8664 4* 9.5595* 


0.9097 

-12.8664- 9.5595* 


0.1685 

-2.4141 


0.0162 

-0.3937 4- 0.3861* 


0.1416 

-0.3937- 0.3861* 

k 3 

0.4472 

— 12.9520 4* 9.4886* 


1.1113 

-12.9520- 9.4886* 


0.2087 

-2.4404 


0.0279 

-0.4668 4-0.4539* 


0.1716 

-0.4668 - 0.4539* 

K a 

0.6325 

-13.1214 + 9.3446*' 


1.3684 

-13.1214 - 9.3446* 


0.2642 

-2.4911 


0.0500 

-0.5509 + 0.5291* 


0.2106 

-0.5509- 0.5291* 

k 5 

0.8944 

-13.4526+ 9.0474* 


1.7032 

-13.4526- 9.0474* 


0.3450 

-0.6445 + 0.6078* 


0.0916 

-0.6445 - 0.6078* 


0.2629 

-2.5861 

K 6 

1.2649 

-14.0877+ 8.4129*' 


2.1515 

-14.0877- 8.4129* 


0.4690 

-0.7426+ 0.6822* 


0.1699 

-0.7426- 0.6822* 


0 t 3339 

-2.7556 
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Table 2. System, control gain, and estimator parameters (contd) 


(c) Estimator gains 


Estimator gain L 


Numerical value 


Estimator poles from Compliance zeros from 

eigenvalue (A — BK+ — LC) eigenvalue (A — BK 4 — LC) 


L 1 


L 2 


l 3 


U 


0 

-12.7800 + 9.62981 

-13.1060 + 

9.3382t' 

0.4546 

-12.7800 - 9.6298i 

-13.1060 - 

9.3382t 

0.0033 

-2.3859 

-2.1842 


—0.0020 

-0.4559 

-1.4267 


0.0026 

0 

-0.4674 


0.8638 

-12.4988 + 9.7652; 

-12.6999 + 

9.4375; 

3.1261 

-12.4988- 9. 7652t 

-12.6999- 

9.4375; 

4.8336 

-4.8845 

-5.0267 


-7.6308 

-1.1912 

-2.5190 


-9.1392 

0 

-0.0162 


0.8075 

-12.7656 + 9.63451 

-13.0283 + 

9.3205; 

1.4625 

-12.7656 - 9.6345; 

-13.0283- 

9.3205; 

1.1140 

-1.9392 + 0.7044; 

-2.6041 + 

1.3095* 

-0.6472 

-1.9392 - 0.7044; 

-2.6041 - 

1.3095*’ 

0.5716 

0 

-0.0333 


0.6204 

-12.7686 + 9.6345; 

-13.0829 + 

9.3379; 

0.5545 

-12.7686- 9.6345; 

-13.0829 - 

9.3379; 

0.1357 

-2.5083 

-2.0488 + 

0.4207; 

-0.3804 

-0.4562 

-2.0488- 

0.4207; 

-0.5582 

0 

-0.1267 



Table 3. Typical disturbance torque levels 

Azimuth Elevation 

Wind moments for 30-mph wind 2.67 x 10 6 ft-lbs 2.70 x 10 6 ft-lbs 

729 psi 738 psi 

1.03 - 1.28 x 10 6 
280 — 350 psi 


Breakaway friction torque 


0.84 - 1.46 x 10® ft-lbs 
230 — 400 psi 




CT COUNTERTORQUE 
MOTOR 


SYSTEM 

PRESSURE 


Fig. 1. The 70-m antenna axis servo countertorque system. 



Fig. 2. Servo hydromechanical system linearized model. 
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Fig. 4. Servo position controller block diagram. 



TIME, sec 


Fig. 5. Disturbance torque transient response of open-loop, computer-mode, and 
precision-mode configurations. 








AZIMUTH ERROR, mdeg AZIMUTH ERROR, mdeg 


// 





AZIMUTH ERROR, mdeg AZIMUTH ERROR, mdeg 



Fig. 8. Computer-mode disturbance torque transient response for various L 



TIME, sec 

Fig. 9. Precision-mode disturbance torque transient response for various K 
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The Radio Science System installed at Parkes, Australia for the Voyager Nep- 
tune encounter was specified to meet the same stringent requirements that were 
imposed upon the DSN Radio Science System . This article describes the system 
design and test methodology employed to meet these requirements at Parses, and 
presents data showing the measured performance of the system. The results in- 
dicate that the system operates with a comfortable margin on the requirements. 
There was a minor problem with frequency-dependent spurious signals which could 
not be fixed before the encounter. Test results characterizing these spurious signals 
are included. 


I. Introduction 

In conjunction with efforts to upgrade the DSN in sup- 
port of the Voyager Radio Science experiments at Neptune, 
a Radio Science receiving system was temporarily installed 
at the Commonwealth Scientific Industrial Research Orga- 
nization (CSIRO) Radio Telescope located in Parkes, New 
South Wales. The primary motivation for the Parkes in- 
stallation was to allow improved signal reception at X-band 
through the application of signal arraying techniques be- 
tween the Parkes 64-m antenna and the Canberra Deep 
Space Communications Complex (CDSCC) 70-m antenna 
located some 300 km away. 

This article presents an overview of the Radio Science 
System at Parkes. Section II covers the performance re- 
quirements imposed by the Voyager Neptune encounter, 
and is followed in Section III by a description of the design 


chosen to meet these requirements. The test methodology 
for verifying system performance is discussed in Section IV, 
with the results from tests performed in March 1989 pre- 
sented in Section V. 

The results in Section V show a system that is meet- 
ing its performance requirements to an outstanding degree. 
Both the long- and short-term frequency stability measure- 
ments show performance margins of about an order of mag- 
nitude better than the requirements. The only exceptions 
are the frequency-dependent spurious signals generated by 
an internal frequency synthesizer which is used to tune the 
carrier into a narrow bandwidth. This represents a subsys- 
tem problem that was identified at an early stage, but was 
not deemed significant enough to warrant a major subsys- 
tem change prior to encounter. In order to understand and 
characterize these spurious signals, however, a calibration 
test was run, and the results are also included in Section V. 
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Although this article presents a complete overview of 
the Parkes system, it supplements a previous article doc- 
umenting the associated Radio Science Systems installed 
at CDSCC and at the Usuda, Japan station [1]. Inter- 
ested readers will also find in this reference a description 
of the Radio Science experiments, an elaboration on the 
phenomenon of phase noise and how it degrades the per- 
formance of the Radio Science System, and a fuller account 
of the system test hardware and methodology. 


II. System Performance Requirements 

Spacecraft Radio Science is concerned primarily with 
two types of measurements: (1) occultation measurements 
in which the spacecraft signal passes through a planetary 
atmosphere, ionosphere, or ring system as it propagates 
to Earth, and (2) gravitation measurements in which the 
planet’s gravity field perturbs the spacecraft trajectory and 
hence the downlink Doppler frequency [2]. Radio Science 
as performed at Parkes, however, is concerned only with 
the occultation measurements. 

For these measurements, there are certain aspects of 
the spacecraft carrier, namely the fluctuations in its ampli- 
tude and phase, from which the properties of the planetary 
medium can be determined. The Radio Science System 
is required to record the spacecraft carrier with minimal 
degradation to these parameters. However, there are many 
noise processes in the system that may contribute toward 
their degradation. These include thermal noise, short-term 
phase noise, receiver-generated spurious signals, long-term 
phase drifts due to diurnal temperature changes, and gain 
instabilities. 

Of these noise sources, the most critical are those re- 
lating to phase stability, both long- and short-term, in- 
cluding spurious signals. Although amplitude, or gain, in- 
stabilities are important, they typically exist at levels well 
below comparable levels of the phase instabilities. This 
is because any phase-noise components and line spectra 
spurious signals that enter the first local oscillator (LO) 
chain at a low frequency level, say 5 MIIz, are severely 
exaggerated by the frequency multiplication performed to 
generate the X-band LO. This property does not apply to 
amplitude noise. 

Long-term phase fluctuations on the order of 1000 sec 
in the receiver LOs can prevent the correlation of data 
between two receiving stations, which in turn reduces the 
effective signal- to-noise ratio (SNR) of the combined data. 
Short-term phase-noise processes generated in the receiver, 
on the other hand, corrupt the power spectrum of the re- 


ceived signal more directly by masking the phase fluctua- 
tions placed upon the carrier by the planetary media. Spu- 
rious signals generated within the receiver also corrupt the 
received signal spectrum in such a way as to interfere with 
the filtering that is performed when the data are processed. 

Since the phase stability of the Radio Science System 
is important over such a wide range of time scales (from 
1000 sec to 0.1 msec), the requirement for it is divided into 
two requirements over two different time scales. The long- 
term phase stability is specified from 1 to 1000 sec using the 
Allan variance parameter, which is a universally accepted 
measure of fractional frequency deviation [3, 4]. The Voy- 
ager Neptune requirement for the system Allan deviation 
is 6 x 10“ 13 for 1-sec integration times and 3 x 10“ 13 for 10- 
to 1000-sec integration times. The short-term phase sta- 
bility is specified using £(/ m ), the single-sideband spectral 
density of the phase noise at a carrier offset frequency of 
/ m , where f m ranges from 1 Hz to 10 kHz. The required 
level of the single-sideband phase noise is -53 dBc/Hz at 
an f m of 1 Hz, and —60 dBc/Hz from 10 Hz to 10 kHz. 

The spurious signals mentioned previously refer to 
discrete sinusoidal components within the sideband spec- 
trum which are generated when line currents from AC 
power supplies or other electromagnetic interference (EMI) 
sources modulate the LO signals. The Voyager Neptune 
requirement for the system spurious signals is the same 
level as the phase-noise requirement using a 1-IIz mea- 
surement bandwidth (although the spurious signals main- 
tain a level independent of the measurement bandwidth): 
—53 dBc single-sideband at 1 Hz offset from the carrier, 
and —60 dBc from 10 Hz to 10 kHz. 


III. System Configuration 

Figure 1 shows a block diagram of the Radio Science 
System at Parkes. The antenna is a 64-m dish which gath- 
ers and focuses incoming signal flux density to a feed in the 
aerial cabin, a small room situated at the apex of the tri- 
pod structure built upon the dish. Inside the aerial cabin 
are two traveling- wave masers (TWMs) which can be con- 
figured to look at the sky or an ambient load termination. 
The noise-adding radiometer (N AR) injects its noise diodes 
immediately prior to the TWMs. Also in the aerial cabin 
are two RF-IF converters which convert the X-band re- 
ceived signal of 8415 MHz to a 315-MHz IF. Each TWM 
has its own dedicated RF-IF converter. Taken together, 
the two TWM and RF-IF channels constitute a primary 
and backup subsystem. Following conversion, the two IF 
outputs are sent down a cable run to the Parkes Canberra 
Telemetry Array (PCTA) van. Inside the van is a switch- 
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ing assembly that selects the prime or backup IF channel 
and sends it to both the telemetry receiver and the Radio 
Science Data Acquisition Subsystem (DAS). 

The DAS contains dual parallel IF-VF converter chan- 
nels and recording assemblies. Two channels are needed to 
prevent loss of data when the spacecraft switches between 
the coherent mode of transmission in which the downlink 
is phase-locked to the uplink, and the noncoherent mode 
in which the downlink is derived from the onboard Ultra- 
Stable Oscillator (USO). This is because a single-channel 
DAS is not able to continuously track the 5.4-MHz shift 
in received X-band carrier frequency that results from the 
mode change; it would have to be halted and restarted 
with a different frequency-tuning predict set. Thus with 
a dual-channel DAS, each of the two channels is assigned 
one of the modes and is preprogrammed to track the time- 
frequency profile of this mode. The two channels are run 
in parallel so that when a mode change occurs, the signal 
disappears from one channel and appears immediately on 
the other channel. 

The basic channel in the DAS consists of a Radio Sci- 
ence IF-VF Converter (RIV) followed by a Mark III Occul- 
tation Data Assembly (ODA) that digitizes the video sig- 
nal, tape records the signal, and controls the LO frequency 
in the RIV. The ODA is made up of a Mark III Narrow- 
Band Occultation Converter (NBOC), a Modcomp II com- 
puter with an IBM PS/2 computer to provide terminal in- 
put, and two Wanco tape drives for storing digitized data. 
For frequency tuning purposes, the Modcomp II computer 
accepts and stores predict sets of up to 42 points. The pre- 
dict sets are entered through the PS/2 terminal. The local 
oscillator used in the RIV is tunable by the ODA computer 
so that the received spacecraft carrier containing Doppler 
shift can be kept within the 35-kIIz VF-filter bandwidth. 
The LO mixer injection frequency is derived by frequency 
multiplying the output of the computer-controlled Dana 
synthesizer by a factor of seven in order to achieve the de- 
sired LO frequency range. The tuning capability is pro- 
vided by the Programmed Oscillator Control Assembly 
(POCA), which takes frequency and ramp-rate commands 
from the Modcomp II computer and controls and monitors 
the Dana synthesizer accordingly. 

The Parkes IF-VF converter is different from the type 
used in the DSCC Radio Science System in terms of how 
the bandwidth limiting is accomplished and how the adja- 
cent sideband noise spectrum is rejected from folding over 
into the passband. The DSCC design uses a quartz-crystal 
bandpass filter with a center frequency that is offset from 
the final LO, while the Parkes configuration uses a single- 
sideband (SSB) mixer for converting the IF directly to VF. 


The SSB mixer passes the upper sideband, which contains 
the carrier, and rejects the lower sideband, which contains 
only noise. The LO for the SSB mixer is offset below the 
carrier by one-half of the desired video bandwidth. A VF 
lowpass filter (LPF) following the SSB mixer band lim- 
its its output so that the combination forms an equivalent 
35-kHz bandpass-type filter, which acts as an anti-aliasing 
filter for the digital sampling process. A VF amplifier fol- 
lows the LPF to drive the digitizer in the NBOC. 

The video signal is monitored on a spectrum analyzer 
at a point just prior to its digitization. Also, an RMS volt- 
meter measures the level of the input signal to the analog- 
to-digital converters (ADCs) in the NBOC. During system 
precalibrations, the RIV attenuator is adjusted to achieve 
a video level, which is largely thermal noise power, of about 
1 volt RMS in order to avoid saturating the ADCs. 

After sampling the signal, the NBOC sends the digi- 
tized data to the Modcomp II for magnetic tape recording. 
The Modcomp II reads each byte immediately after it has 
been written to the tape and sends the data back to the 
NBOC. The NBOC then converts the digital samples back 
to an analog signal that can be monitored with a spectrum 
analyzer to verify that the desired data are being recorded 
to tape. 

The Frequency and Timing Subsystem (FTS) at 
Parkes uses two hydrogen-maser frequency standards lo- 
cated in a temperature-stabilized room that is remote from 
the PCTA van for magnetic isolation purposes. The fre- 
quency standards provide 5-MIIz references to the PCTA 
van, each sent through a quartz-crystal oscillator clean-up 
loop (CUL) to improve the phase noise at low signal offset 
frequencies near 1 Hz. The output of the prime CUL is 
distributed throughout the van using two IIP 5087 distri- 
bution amplifiers. The IIP5087 amplifiers supply reference 
signals to all of the time-code generators (TCGs) in the 
van, which in turn provide timing signals to the ODA and 
telemetry computers. 

The redundant receiver first local oscillators are de- 
rived from the 5-MHz CUL output. First, a x20 multi- 
plier in the PCTA van multiplies the CUL output up to 
100 MHz. Two 100-MHz outputs are sent via hardline 
coaxial cables to the aerial cabin. There the lOO-MIIz sig- 
nals are further multiplied by a factor of 81 to provide the 
8.1-GHz first LOs to the RF-IF downconverters. 


IV. System Test Methodology 

The system testing at Parkes was aided by the test 
methodology and hardware that had been developed for 
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the CDSCC, Both of the instruments developed for Radio 
Science System stability testing — the test transmitter and 
the digital stability analyzer (DSA) — were designed to be 
transportable between the Canberra and Parkes facilities. 

The block diagram of the Radio Science System at 
Parkes in Fig. 1 shows this test instrumentation. The 
methodology for Radio Science System stability testing 
involves injecting a highly stable RF test signal into the 
TWM input and measuring the stability of the test sig- 
nal after it has been downconverted to video-frequency, 
digitized by the NBOC, and then reconstructed back to 
an analog waveform. This procedure is a direct means of 
measuring the amount of degradation the entire system 
imposes upon the received signal. 

The RF test signal is generated by the portable 
test transmitter. The transmitter contains a phase-stable 
quartz-crystal oscillator for performing phase-noise mea- 
surements, and is phase- locked to an FTS 5-MHz refer- 
ence for performing Allan variance measurements. To reli- 
ably measure the Radio Science System’s performance, the 
transmitter was designed with performance requirements 
about an order of magnitude better than the Radio Science 
System requirements. 

There are two places where the RF test signal can be 
injected into the system. The test transmitter is equipped 
with an RF horn so that the signal can be radiated into 
the antenna feed from a strategic point on the ground. 
However, the proximity to the antenna of most convenient 
locations on the ground tends to give rise to multipath 
effects which distort the test results. For all of the sys- 
tem tests described in Section V, the multipath problem is 
avoided by injecting the test signal into the system through 
a coupler at the TWM input. 

In order to improve the system SNR, a 30-dB attenu- 
ator is inserted into the signal path immediately following 
the TWM. This allows injecting a 30-dB stronger signal 
into the TWM without saturating the following receiver 
subsystem. However, the resulting increase in SNR is only 
about 19 dB because the thermal noise of the receiving 
subsystem now becomes the dominant component of the 
system noise temperature. 

For signal analysis, the digital stability analyzer ac- 
cepts the system’s video- frequency output either prior to 
or following digitization and processes it for phase noise 
and Allan variance information. The DSA is essentially a 
computer containing its own ADC for signal acquisition, as 
well as digital signal processing hardware and software to 


measure the power spectrum of a video test signal and de- 
compose it into the quadrature component amplitude and 
phase spectra. The system phase-noise response can be 
read directly from the phase spectrum. For Allan variance 
tests, the DSA contains hardware which downconverts the 
video signal to 1 Hz. A time-interval counter samples the 
zero crossing of this 1-Hz signal and sends it to the DSA’s 
computer for processing [1, 3, 4], 

During the phase-noise test, the test transmitter is 
ideally configured to run on its internal oscillator so that 
there is no coherence between the system’s reference fre- 
quency and the test signal. With this configuration the 
entire system performance is measured, including that of 
the hydrogen-maser frequency standard. On the other 
hand, for Allan variance testing the test transmitter must 
be phase-locked to the station FTS because the long-term 
stability of its quartz-crystal oscillator is inadequate for 
measuring system performance over long test periods. Be- 
cause the transmitter is phase- locked to the station FTS, 
the FTS effects cancel out. Therefore, the stability of the 
FTS must be measured separately with the DSA by com- 
paring the two similar hydrogen- type masers. 

V. System Test Results 

The testing of the Radio Science receiver at Parkes 
helped to uncover and remove many problems that were 
not apparent at the subsystem level. However, the re- 
sults shown here include only typical performance curves 
taken after most of the problems had been removed. In 
addition, measurements taken with the NAR operating 
in encounter configuration are also presented. The last 
section summarizes a special calibration test made on the 
Dana LO synthesizers in the IF-VF converters. This test 
was performed to help characterize the behavior of certain 
frequency-dependent spurious signals that are generated 
by these synthesizers. 

A. Allan Variance Test Results 

Figure 2 shows the Allan variance results of the sys^ 
tern measured with the DSA. These data were taken using 
the test transmitter, the prime 5-MHz CUL, the hydrogen 
maser, and the backup (channel 1) x81 multiplier feed- 
ing into the prime (channel 2) RF-IF downconverter. The 
unusual x81 configuration was implemented to bypass a 
malfunctioning unit. These results represent the system 
at its best performance. The Allan deviation at 1 sec is 
almost an order of magnitude below the 6. OEM 3 specifi- 
cation. The only concern is the peak at about 1000 sec 
which represents a long-term drift in the system phase. 
The source of the drift was not determined with confi- 
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dence, so it could be due to the test transmitter, system 
cable temperature effects, or the LO multiplier chain. In 
any event, the peak is still an order of magnitude better 
than the system specification. 

B. Phase Noise and Power Spectrum Tests 

Figure 3 shows the system phase-noise spectral density 
measured with the DSA. The hydrogen-maser frequency 
standard was used. The system configuration was not opti- 
mal since a x81 frequency-multiplier unit that later proved 
to be defective was used, although it was not malfunction- 
ing during this period. The frequency span of the plots is 
200 Hz around the carrier and shows detail on a number 
of spurious signals. Of particular interest is a set at 1 Hz, 
approximately —59 dBc below the carrier. The source of 
these spurs was not determined; how T ever, they were not 
present during the later testing period and do not show up 
in the final data (for example, see Fig. 4). 

Note that the level of the discrete sinewave compo- 
nents in Figs. 3 through 7 is adjusted to correct for the 
resolution bandwidth of the spectrum measurement. A 
spectrum analyzer typically measures the level of the dis- 
crete components of a signal correctly, but the continu- 
ous, broadband components need to be scaled by one over 
the resolution bandwidth of the measurement to indicate 
spectral density in units of dBc/Hz. For Figs. 3 through 
7, the normalization has already been applied to the ver- 
tical axis so that the broadband noise components can be 
read directly off the plots in units of dBc/IIz. However, 
the normalization of the vertical axis makes the discrete 
component levels incorrect, so an adjustment is made to 
indicate their correct levels in units of dBc. Stated simply, 
the spectrum plots are corrected to show what the spec- 
trum would look like if it was measured with a resolution 
bandwidth of exactly 1 Hz, and both the discrete (after 
correction) and continuous components are read off these 
plots using the same vertical scale. 

For Figs. 8 through 11, only the discrete components 
are of interest, so the vertical axis is not adjusted. The dis- 
crete components are read off as they are plotted in units 
of dBc without any need for adjustment. The broadband 
components are plotted in units of dBc per the resolution 
bandwidth of the measurement (0.38 Hz, 0.38 Hz, 2.4 Hz, 
and 2.4 Hz respectively). 

The plot in Fig. 3 shows that neither the discrete noise 
components nor the broadband noise components violate 
the phase-noise specification. However, it should be noted 
that the test transmitter in this case is phase-locked to the 


station FTS 5-MHz reference, so that the effects of FTS 
have cancelled out. 

In order to measure the performance of the entire Ra- 
dio Science System, including the FTS standard, the above 
test was repeated ten days later with the system again run- 
ning on the hydrogen- maser frequency standard but with 
the test transmitter free running on its internal quartz- 
crystal oscillator. The uncertainty in the output signal 
frequency from the test transmitter made the use of the 
digital stability analyzer impractical for making system 
measurements, so the HP3561 spectrum analyzer was used 
to measure the power spectrum. Since the power spectrum 
is the root-sum-square of the phase and amplitude spectra, 
a power spectrum which meets the phase noise require- 
ment implies a phase-noise spectrum which also meets the 
requirement. 

Figures 4, 5, 6, and 7 show the single-sideband power 
spectrum around the carrier over four different frequency 
spans: 20 Hz, 400 Hz, 2 kHz, and 20 kHz. The close-in 
single-sideband noise power at 1 Hz is seen in Fig. 4 to 
be —57.3 dBc/IIz. This compares well with previous mea- 
surements of the test transmitter which showed a single- 
sideband phase noise at X-band of —58 dBc/IIz. The level 
at 10 Hz is less than —65 dBc/Hz. From 100 Hz to 10 kHz, 
the noise is less than -75 dBc/Hz. The power line spurs 
are less than -60 dBc, while the Dana spurs at 2.85 kHz 
are —52 dBc. 

Figure 5 shows two sets of spurious signals that are 
generated by the Modcomp II computers in the ODAs. For 
comparison, Fig. 6 shows the power spectrum taken with 
the Modcomp IIs turned off. One set of spurs shows up 
(in Fig. 5) at about 109 Hz and is generated by the prime 
ODA. The other set of spurs at about 57 Hz is generated 
by the backup ODA. Over time both spurs were seen to 
drift randomly in frequency by several hertz or so. 

The source of these spurs was unknown until the 
ODAs were switched off. Further investigation by the per- 
sonnel at Parkes revealed that the spurs were generated by 
clock signals internal to the ODAs, which were not phase- 
locked to the FTS and hence slightly off-frequency. The 
clock signals were phase-modulating the FTS 5-MHz ref- 
erence at the beat note frequency between the sixth har- 
monics of the two timing signals. The interaction was 
made possible through a faulty hardline connector that 
coupled the ODA clock radiation into the FTS 5-MHz sig- 
nal. Even though the resulting spurious phase modulations 
were quite low on the 5-MHz line, they were exaggerated to 
a level of —45 dBc by the frequency multiplication factor 
needed to generate the 8.1-GHz first LO. The replacement 
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of the faulty connector reduced the power of the spurious 
signals to an acceptable level. 

Figure 8 shows the close-in power spectrum of the 
20-kHz VF carrier with 3.5-Hz and 10.5-IIz offset spurs 
present. This test was run with the NAR injecting a 50-K 
noise diode modulated at 20 IIz into the signal path before 
the low-noise amplifier (LNA) maser. Since the spurs are 
only 48 dB below the carrier, and there was no time to 
investigate and reduce the interaction, it was recommended 
that the 50-K noise diode not be used during the prime 
Radio Science data-gathering time periods. Fortunately, 
the 1-K noise diode, which is currently planned for use 
during the encounter, does not contribute any measurable 
effect above —65 dBc, as shown in Fig. 9. Note that the 
test configuration was run without the 30-dB attenuator 
after the maser that is normally installed during system 
testing to improve the test signal’s SNR, so the noise floor 
with this 100-Hz span is only —67 dBc. This was necessary 
because the 30-dB attenuator in the front end changes the 
system noise floor and renders the NAR inoperable. 

C. Dana Calibration Tapes 

A series of calibration tapes was run at Parkes to char- 
acterize the spurious signals generated by the Dana synthe- 
sizers, which are used to provide the tuned local oscillator 
in the IF-VF converter. During the early phase of subsys- 
tem testing at JPL, these synthesizers had exhibited two 
types of spurious signals which violated specifications. The 
first type was induced by power supplies in the Dana chas- 
sis and nearby instruments which modulated the Dana’s 
crystal oscillator at the power-line frequency via magnetic 
field coupling. These spurs were reduced by shielding the 
synthesizers from magnetic flux and moving their internal 
power supplies into a separate chassis. The second type 
are frequency-dependent modulations on the synthesizer 
output that are generated by the internal digital circuitry 
of the synthesizer. These spurs have been measured on the 
video carrier as high as about —52 dBc in some cases and 
may actually shift across the carrier signal as the Dana 
frequency changes. Unfortunately, they could not be re- 
moved from the device without a fundamental change in 
its design. 

In order to characterize the expected behavior of these 
spurs during the encounter, predict sets were generated 
that stepped the POCA-controlled Dana across the im- 
portant parts of the encounter frequency range. Energy 
level is examined to characterize the effect of a spur close 
to the carrier when the Dana frequency is changed. A total 
of three predict sets of 21 Trequency steps each was con- 
structed. One set was made to cover the one-way mode 


prior to the Neptune occultation, a second set was made 
for the two-way mode prior to occultation, and a third set 
was made to cover the one-way mode after the occulta- 
tion. The frequency range of the predict sets is defined 
in Table 1. The actual frequency of each point can be 
determined by the equation: 

_ zYFstop — Fstart) _ 

F(?) = Fstart -f — for i — 0 to 20 

where Fstart and Fstop are taken from Table 1. 

The tapes were made by simulating the spacecraft sig- 
nal with an HP8662 synthesizer injected into the IF input 
at the RIV. During recording, the IIP8662 was stepped in 
frequency with an HP85 computer so that the resultant 
video remained at 20 kHz for each step. The duration of 
each step was about 18 sec, with 1 sec allowed for frequency 
changes between steps. 

Both of the Danas were calibrated with the three pre- 
dict sets for a total of six tapes. The tapes were analyzed 
on the JPL Radio Occultation Data Analysis (RODAN) 
computer. Some of the results are presented below. 

In general, the Dana with NASA serial #106227 is su- 
perior to Dana #118430 because it has much fewer spurs. 
Figures 10 and 11 compare #106227 with #118430 at a 
Dana frequency of 45578013.78 Hz taken from the one- 
way pre-occultation predict set. The pair of spurs at about 
1.5 kHz from the carrier is at —57 dBc. When the Dana 
frequency is in this neighborhood, the 1.5-kHz spurs drift 
at a rate equal to ten times the change in Dana frequency, 
so it is natural to assume that they will cross the carrier 
at about the same level. In other words, if the Dana fre- 
quency increases by 100 Hz, the spurs will move closer to 
the carrier by 1 kHz, or will be about 500 Hz away. 

Not all of the spurs obey this 10-to-l dependency upon 
the Dana frequency. Other plots not shown here reveal a 
1-to-l relationship, and in one case a relationship much 
higher than 10 to 1 was observed. 

In order to calculate the real effect of the Dana spurs 
upon the system, one must consider the rate at which they 
will move during encounter. Since a moving spur spreads 
its power across a finite frequency range, its level at one 
discrete frequency point in a measured power spectrum 
will be much lower than that of a stationary spur with the 
same power level. How much lower depends upon the rate 
at which the spur moves. 
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For example, the spectra in Figs. 10 and 11 are 
obtained by averaging successive fast-Fourier transforms 
(FFTs) of the sampled video signal. The data were sam- 
pled at 80 kilosamples/sec. A total of 32768 data points 
is used for each FFT. This gives an FFT bin spacing of 
approximately 2.44 Hz, with a data acquisition time of 
0.4096 sec for one buffer of data. If instead of being a 
fixed frequency, the LO had tracked the carrier at a rate 
of about 30 Hz/sec due to Doppler (about the highest rate 
predicted for the Neptune encounter), the carrier would 
remain steady at the video frequency, but the spurs would 
change frequency. During the time it took to acquire the 
data for one FFT, the LO would move by 12.288 Hz. The 
Dana frequency itself would only change by 1.75 Hz be- 
cause of the x7 frequency multiplier placed after the Dana. 
If the spurs were moving at a rate ten times the Dana 
frequency change, they would have shifted by 17.5 Hz or 
about seven FFT bins. Assuming a uniform ramp rate, the 
signal power would be divided evenly among seven FFT 
bins instead of one, with each bin showing only about one- 
seventh the original power of the spur. The peak power in 
this smeared spur would then be 8.5 dB lower than if the 
spur had not been smeared. If, however, the spur obeyed 
a 1-to-l dependency upon the Dana frequency, it would 
move only 1.75 Hz and would likely spend most of the time 
within a single FFT bin, so no reduction in level would be 
observed. 


VI. Conclusions 

This article has outlined the installation of the Ra- 
dio Science System at the Parkes Radio Telescope in New 
South Wales, Australia. The Voyager Neptune encounter 
performance requirements and system design were dis- 
cussed, as well as the techniques used to measure the per- 
formance of the system and the actual data taken on site 


during performance verification. The results shown in Sec- 
tion V indicate that in general the system is performing 
with a comfortable margin on the desired characteristics. 
The problem areas remaining in the system are listed be- 
low. 

In the area of Allan variance performance, the system 
is operating at about an order of magnitude better than 
specification. The only concern lies in a long-term phase 
drift that makes the Allan deviation rise up to about IE-14 
at 1000 sec. This is not in violation of specification, but 
could be a problem if it gets any worse. 

In the area of phase-noise performance, the system 
is also operating at a level better than specification. An 
initial problem with the ODA computers generating un- 
wanted modulations on the first LO was corrected with 
the replacement of a faulty hardline connector. The only 
remaining violations are the discrete frequency compo- 
nents generated by the Dana synthesizers described in Sec- 
tion V, which were considered too minor to warrant a ma- 
jor subsystem change to remove them. 

Finally, a problem is noted concerning which noise 
diode is used in the NAR. A 50-K noise diode generates 
interfering spurious signals that show up at 3.5 and 10.5 Hz 
on the carrier at video frequencies, at about —48 dB be- 
low the carrier. However, no spurs above —65 dBc are 
measured when the 1-K noise diode is used. 

The system is considered adequate and ready to re- 
ceive and record the Voyager spacecraft signal during the 
upcoming Neptune encounter. It meets the basic system 
requirements and contains the additional functional ca- 
pability to receive the downlink signal during the abrupt 
frequency shifts associated with spacecraft mode changes 
without loss of any data. 
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Table 1. Predict sets used for Dana calibration 


Set type 

X-band range (GHz) 

POCA range (MHz) 

Fst&rt 

.Fstop 

Pstart 

P’stop 

One-way 
Pre-occult ation 

8.419059 

8.419124 

45.577085 

45.586371 

Two-way 

Pre-occultation 

8.41365255 

8.41371755 

44.804735 

44.814021 

One-way 

Post-occultation 

8.419340 

8.419404 

45.617228 

45.626371 
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Fig. 1 . Block diagram of the Parkes Radio Science System configuration. 
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Fig. 2. Allan variance of the Parkes Radio Science System. 



Fig. 3. Phase-noise spectral density of the Parkes Radio Science System, test transmitter 
phase-locked to FTS, -100 to +100 Hz (center frequency— 20 kHz). 
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Fig. 4. Noise-power spectral density of the Radio Science System, test transmitter 
free-running, —10 to +10 Hz (center frequency = 19.95 kHz). 
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Fig. 5. System noise-power spectral density showing ODA-induced spurious signals, 
— 200 to +200 Hz (center frequency = 19.95 kHz). 
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Fig. 10. Power spectrum showing Dana #1 06227-generated spuri- 
ous signals, Dana frequency = 45.578013 MHz. Plot computed by 
JPL RODAN computer from tape-recorded samples. 
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Fig. 11. Power spectrum showing Dana #11 8430-generated spuri- 
ous signals, Dana frequency = 45.578013 MHz. Plot computed by 
JPL RODAN computer from tape-recorded samples. 
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The results of preliminary 32-GHz calibrations of the 70-meter antenna at 
Goldstone are presented. Measurements were done between March and July 1989 
using Virgo A and Venus as the primary efficiency calibrators. The flux densi- 
ties of these radio sources at 32 GHz are not known with high accuracy , but were 
extrapolated from calibrated data at lower frequencies. The measured value of ef- 
ficiency (0.35) agreed closely with the predicted value (0.32), and the results are 
very repeatable. Flux densities of secondary sources used in the observations were 
subsequently derived. These measurements were performed using a beamswitching 
radiometer that employed an uncooled high-electron mobility transistor (IIEMT) 
low-noise amplifier. This system was installed primarily to determine the perfor- 
mance of the antenna in its 1989 configuration, but the experience will also aid in 
successful future calibration of the Deep Space Network (DSN) antennas at this 
frequency. 


I. Introduction 

NASA/JPL is planning to use the 32-GHz frequency 
band for communications and navigation of future deep- 
space missions. Performance estimates of existing and fu- 
ture ground station capabilities are needed for both mis- 
sion planning and technology development. A research 
and development radiometer has been placed on the DSS- 
14 70-meter antenna as a precursor to the implementa- 
tion of 32-GHz systems on the 70-meter antenna network. 
The radiometer will be used to determine the current per- 
formance characteristics of the large antennas at this fre- 
quency and to determine what upgrades may be advisable 


for future systems. Our experience with the 70-meter cal- 
ibrations will also contribute to successful calibrations of 
the new 34-meter antennas, which will be built in the fu- 
ture. 

This article describes the measurements performed at 
32 GHz between March and July, 1989, on the recently 
upgraded (64-m) 70-meter antenna at Goldstone (DSS-14). 
These measurements were done on seven separate days by 
using radiometric observations of six astronomical radio 
sources: 3C274 (Virgo A), 3C273, 3C84, 3C286, P2134 + 
004, and Venus. 
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It is important to note that the flux densities of nat- 
ural radio sources are not accurately known at 32 GIIz. 
Very few attempts to calibrate radio source intensities on 
an absolute scale have been reported above 10 GHz, so that 
microwave spectra of even the most intense radio sources 
are increasingly uncertain at high frequencies. Estimates of 
the uncertainty in the absolute flux density scale at 32 GIIz 
' are typically 10 percent or more. The precision of the mea- 
surements reported in this article is sufficiently high that 
error estimates for the results are clearly dominated by the 
absolute flux error and not by random errors. 

II. Radiometer Description 

A. Description 

At 32 GHz, radiometry is complicated by tropospheric 
effects. Specifically, the system operating temperature 
varies due to clouds, moisture, and other weather phe- 
nomena. The measurement accuracy of the increase in 
system temperature due to a radio source is, therefore, sig- 
nificantly impacted for all but the best of observing con- 
ditions. There are techniques to eliminate these adverse 
effects; the most common one is to use a beam-switching 
radiometer. This radiometer is an adaptation of the one 
proposed by Dicke [1] wherein the difference in tempera- 
ture between a reference signal and the unknown signal is 
measured by the system. For our radiometer, the reference 
signal is derived from an antenna beam pointing slightly 
off-axis of the desired antenna beam. The reference is the 
cold sky, and the unknown is the source temperature plus 
the cold sky. The difference in detected power is, therefore, 
proportional to the source temperature. 

The reference signal is detected when the system is 
switched to the second feedliorn, which is laterally dis- 
placed from the Cassegrain focus of the 70-meter reflector. 
For this application, the displacement is 3.375 inches. This 
displacement produces a beam for this feed that is offset 
approximately 30 millidegrees lower in elevation than the 
beam produced by the on-axis feed (boresight) of the an- 
tenna. This represents approximately 4 beamwidths of the 
pattern defined by the on- axis feed. This technique is sim- 
ilar to that proposed by Slobin, et al. [2]. The off-axis 
reference antenna beam is used to provide an “off-source” 
reference temperature for the Dicke switched radiometer. 
For this purpose, the reference beam need not be precisely 
matched to the on-axis beam, but the cancellation of vari- 
ations in sky emission is improved if the two beams yield 
nearly identical system temperatures when each in turn is 
directed at the sky. 

Figure 1 shows a block diagram of the radiometer 
system. Figure 2 is a photograph of the system as as- 


sembled for test. The block diagram shows the two sepa- 
rate feedhorns, each of which produces an antenna beam 
when placed on the 70-meter antenna. The feedhorn that 
produces beam 1 (as designated on the block diagram) 
is placed at the focal point of the reflector system. The 
signals from each beam are alternately sampled by a cir- 
culator switch that operates between 2 and 20 Hz. The 
low-noise amplifier (LNA) consists of a high-electron mo- 
bility transistor (IIEMT) device. The system requires 
no cryogenic devices and is quite compact. The remain- 
ing portions of the system consist of follow-up amplifiers, 
down converters, detectors, and a Stanford Research SR- 
510 lock-in amplifier. 

The signal from the Dicke switch is a modulated 
square wave that is the difference between the signals in 
the two beams. The lock-in amplifier demodulates this sig- 
nal and produces the desired difference signal. The strip 
chart recorder plots this difference signal as a function of 
time. If each antenna beam is observing the same target, 
such as the sky, and if the system is perfectly balanced, a 
zero signal from the lock-in amplifier is recorded. When 
beam 1 is pointed to a radio source and beam 2 is off the 
source, the signal is proportional to the increase in system 
temperature due to the radio source. The output from a 
diode noise source is coupled into the waveguide of beam 1 
to calibrate the system. The diode calibration is routinely 
switched on several times each hour as the radio sources 
are observed. With this procedure, the increase in system 
temperature produced by each radio source is calibrated, 
and the adverse effects of gain variations in the IIEMT and 
receiver are virtually eliminated. The aperture efficiency 
of the antenna is then derived from the measured source 
temperatures and the assumed values of the source flux 
densities at 32 GIIz (see Section IV). 

The radiometer was placed in the XKR feedcone of the 
70-meter antenna at Goldstone. This feedcone contains 
other hardware, including a 22-GIIz Dicke beamswitch- 
ing radiometer and the X-band planetary radar. Figure 3 
shows the top of this feedcone, including the variety of 
feeds. The front-end portion of the radiometer, consisting 
of the feeds, waveguides, switches, amplifiers, and down- 
converter, is located with the feeds in the feedcone. The 
intermediate frequency (IF) signal is sent via coaxial cable 
from the feedcone to the pedestal room where the lock-in 
amplifier, square-law detector, recorder, and pulse genera- 
tor are located. 

B. Operating Characteristics 

The operating characteristics of the radiometer are de- 
fined by several quantities, the most important of which are 
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the stability and the operating system temperature, T op . 
These characteristics limit the sensitivity of the system and 
determine the configuration of the actual observations, i.e., 
switching rates, band widths, follow-up amplifier gain, etc. 
The individual components for this system were measured 
and their contributions to T op were determined. Table 1 
shows the individual contributions to the system temper- 
ature (see Section III for definitions). Also, the system 
temperature was measured by the calibration techniques 
discussed in the next section. The measured value and 
predicted value of T op agreed remarkably well. For these 
and all subsequent measurements, the reference plane for 
Top was at the input to the feedhorn (due to the calibration 
technique). The tolerances reported in this article have 2a 
(95 percent) confidence. The final result is that the effec- 
tive receiver temperature T e equals 391 (±8) kelvin. When 
the effects of the atmosphere and galactic background are 
included (by estimates), it is expected that T op will be 410 
(±10) kelvin. Actual measurements at DSS-14 indicate 
a T op of 415 (±7) kelvin. The next section provides the 
details of the calculation and calibration. 

The radiometer settings for our observations are given 
in Table 2. Included in this table are the time constants for 
the power integration, switching rates for the Dicke switch, 
IF bandwidth, and IIEMT gain. 

III. Calibration Techniques 

A. Definitions 

In this section, the quantities used in the calculation 
of the system temperatures and the quantities used to cal- 
ibrate the system are defined. Following sections show the 
equations used to perform the calibration and calculations. 
The definitions (all temperatures in kelvin) are as follows: 

T op = operating system temperature of radiometer 

T a = effective antenna temperature 

T e = effective receiver temperature 
T s ky — total temperature due to the sky contribution 
T atm — temperature due to the atmosphere 
T ga / = 3.0-K cosmic background radio emission 

T 80 = temperature contribution due to feed spillover 

T qs = temperature contribution due to quadrupod 
scattering 

T t \ = temperature due to the transmission line be- 
tween amplifiers and feed (including switches) 


T p = physical temperature of transmission line 
(300 K) 

T m = temperature due to the LNA 

T etc — temperature due to the follow-up equipment, 
including the downconverter and IF equipment 

Tf = temperature of T etc referenced to the input of 
the LNA, called “follow-up” temperature 

G t i = gain of the transmission line (< 1) 

L t i = loss of the transmission line = 1 /Gti 

G m — gain of the LNA 

For any measurement, the reference plane location for 
calibration must be explicitly stated since the resulting 
temperatures depend on reference location. In the cali- 
brations performed, the measurement reference plane was 
located at the aperture of the feedhorn. Therefore, this 
position was chosen for reporting the various temperatures 
and noise diode values. 

B. Calculation of System Parameters 

For the reference plane chosen in this work, the calcu- 
lation of T op was performed using the following equations: 


T op = T a + T e 

(1) 

Ta — T s ky + T 80 4- Tqs 

(2) 

T s ky — T a tm H~ T ga \ 

(3) 

rn _ rr , T m f T e tc 

i c — i/-f- + 

O'tf 

(4) 

T, = (L tl - 1 )T P 

(5) 

rp T etC 

T ‘ = gZ 

(6) 


Combining Eqs. (2) through (6) into Eq. (1) yields 


T op — |(T afm -f Tg a i ± T s0 + X^)J 

+ [(L tl -l)T p + L t ,(T m + Tj)] (7) 

Using Eq. (7) and the information in Table 1, the value of 
T op is expected to be 409.5 K, which may be compared to 
the measured value given earlier as 415 K. 
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T nd = T op (Y 4 - 1) 


(14) 


C. Aperture Load and Noise Diode Calibrations 

The calibrations to determine the system tempera- 
ture, sky temperature, effective receiver temperature, and 
noise diode temperature were performed using an aperture 
loading technique [3]. This technique uses one microwave 
absorber (a blackbody source) at ambient temperature, 
« 300 K, and another that is also soaked in a bath of 
liquid nitrogen, temperature « 77 K. The two absorbers 
were placed in turn over the aperture of the feed for suc- 
cessive measurements of power. This was done by carrying 
the absorber, a styrofoam container, and liquid nitrogen to 
the feedcone and manually holding the absorber over the 
aperture. Noise power measurements were made on each 
aperture load with the noise diode turned on and off. A 
power meter was used to measure the IF signal in total 
power mode, i.e., the radiometer was not Dicke switching. 
The measurement was done repeatedly for approximately 
5 minutes in order to minimize the time that the absorber 
was in the nitrogen bath for the cold load case. In this 
way, the effects of ice crystallization in the styrofoam and 
absorber were reduced. The actual temperature of the am- 
bient load is measured by a thermometer, whereas the cold 
load was assumed to be 77 K. Typically, eight to ten mea- 
surements of the quantities defined below were obtained in 
the 5 minute period. Before calculating the quantities of 
interest, the following symbols are defined: 


Let 


Yi = 


sky 


Power looking at hot load 
Power looking at sky 


( 8 ) 


*2 = 


sky 


Power looking at cold load 
Power looking at sky 


( 9 ) 


y 3 = 


Ph Power looking at hot load 
P c Power looking at cold load 


( 10 ) 


Y,= 


Pskyfnd Power looking at sky noise diode on 


sky 


Power looking at sky 


(ii) 


From Eqs. (8) through (11) it can be shown that the fol- 
lowing parameters result: 


T op = 


T h -T e 

Y1-Y2 


( 12 ) 


T h - Y 3 T c 
Yi 3-1 


(13) 


Pa — Pop ~~ Pe 


( 15 ) 


where Ph is the temperature of the hot load, P c is the tem- 
perature of the cold load, and P n d is the noise diode tem- 
perature at the measurement reference plane. The mea- 
sured values for these parameters are found in Table 3. 

A total of 31 noise diode calibrations were done on 
three separate days during this observation period. Using 
Eqs. (12) through (15), it was determined that the noise 
diode was quite steady regardless of the outside temper- 
ature conditions, probably because the XIvR feedcone is 
temperature controlled. The outside temperature varied 
from 14.5 degrees C to 31 degrees C, while the cone tem- 
perature range was only 18.5 degrees C to 22 degrees C. 
This is mentioned because the noise diode for this experi- 
ment was not in an environmentally controlled box or oven 
that would improve the stability of the noise diode sig- 
nal. Table 3 shows the normal standard deviation of the 
31 measurements for each of the given quantities. 

From Tables 1 and 3 it can be seen that there is ex- 
cellent agreement between the measured value of T op (Ta- 
ble 3) and the predicted value (Table 1). Also, there is 
excellent agreement between the measured value of T e (Ta- 
ble 3) and the predicted value (Table 1). The temperature 
contribution due to the atmosphere is often important. For 
each of our calibrations, the weather was clear and calm 
with low humidity. From Eq. (15) P a can be calculated, 
which from Eq. ( 2 ) allows estimates of P a tm • If Pgai I s as_ 
sumed to be 3 K, the quadrupod and spillover are assumed 
to be 6.5 K [4]. Thus, P a tm can be estimated to be 6.6 K. 

The linearity measurement of the system is a by- 
product of the noise diode and system temperature cali- 
brations. By injecting the noise diode into the signal path 
during the hot load, cold load, and sky measurements, the 
effects of errors due to system linearity may be calculated. 
This is done by comparing the noise diode value calcu- 
lated from Eq. (14) using different Y 4 factors; specifically, 
by using observations made looking at the hot and cold 
load with and without the noise diode, as is done when 
observing the sky. Observations indicated the difference 
in measured noise diode values is small enough to indicate 
a linear system. It is usually thought that if the linearity 
error is small, no correction should be performed since the 
correction is frequently based on a model that may have 
errors of the same order of magnitude as the linearity error 
itself. 
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IV. Radio Source Data 

A. Radio Sources 

Several criteria were used to select appropriate radio 
sources for the antenna performance measurements. First, 
it was very important to observe the radio galaxy Virgo A 
(3C274) so that the antenna gain measurements could be 
traced to the Virgo A microwave spectrum, which is con- 
sistently referenced in the radio astronomy literature. It is 
one of the few sources whose spectrum has been calibrated 
over a wide range of the microwave frequencies (e.g., see 
Baars et al. [5]). Although the solid angle of the Virgo A 
source is not especially small compared to the half-power 
beamwidth of the 70-m antenna operating at 32 GHz, it 
is by far the most compact of the sources that have been 
used to calibrate the absolute scale of the microwave flux 
density spectrum. 

A second source selected for gain calibration purposes 
was Venus, which is an intense radio source at short cen- 
timeter wavelengths. The planet’s dense atmosphere ex- 
hibits a thermal emission spectrum with an equivalent 
black body temperature of 475 K at 32 GHz [6]. The planet 
was located near the far side of its orbit during the observa- 
tions, and this fortuitous location meant that the angular 
diameter was near its minimum value and much smaller 
than the antenna beam. Venus also has an advantage in 
that models of the radio emission spectrum have been com- 
puted by applying radiative transfer theory to the physical 
and chemical properties of the planet’s atmosphere. The 
properties of the atmosphere on a global scale are quite 
well-known from the in situ data returned by U.S. and So- 
viet probes and orbiting spacecraft. In addition, years of 
ground-based astronomical studies have added other criti- 
cal information to the theoretical work. 

Four other sources were included. The variable radio 
sources 3C84 and 3C273 were observed because they are 
intense emitters at 32 GIIz and they have very small angu- 
lar diameters (< 0.0005 degrees « 1.8 arcseconds), which 
makes them very useful for measuring the antenna beam 
parameters. Both are known to be variable on timescales 
of weeks or months, so they can only be used for rela- 
tive antenna gain measurements. The sources 3C286 and 
P2 134+004 also were observed as a consistency check to 
see if the new flux density results agreed with extrapola- 
tions of their published spectra to 32 GIIz. 


at the time. Two others were DR21 and NCG7027, whose 
flux densities are thought to be known within reasonable 
tolerances. In fact, DR21 was observed, but the measure- 
ments appeared to be contaminated by one or more nearby 
sources. DR21 lies in a highly complex region of thermal 
emission sources in the plane of the galaxy, and the pre- 
sumption is that the angular separation of the two antenna 
beams is small enough that the reference antenna beam is 
detecting neighboring sources of emission. Additional work 
on this problem is planned. 

Table 4 lists the six sources that were used in the cal- 
ibration and summarizes the measured flux density, S\ the 
associated increase in antenna temperature at the reflector- 
set angle of 45 degrees, T S) of the four secondary sources; 
the source size correction factors, C r ; and the increase in 
antenna temperature for a 100 percent efficient antenna. 
This increase in antenna temperature is given by: 

Tl0 ° = (16) 

where k is the Boltzmann constant, k — 1.38062 x 
10 ~ 23 Ws~ 1 K~ l ] and A p is the physical aperture size of 
the antenna, A p = The efficiency is thus given by 


B. Flux Densities 

The flux densities for Virgo A and Venus were used 
for the aperture efficiency measurements. From the mi- 
crowave spectrum of Virgo A published by Baars et al. [5], 
the flux density at 32 GHz should be 14.68J y ±0.8J y (Icr) , 
where 1 J y = 1 Jansky = 10 -26 Wm" 2 Hz” 1 . The uncer- 
tainty is larger than those usually quoted at longer wave- 
lengths because the spectrum (thought to be linear) must 
be extrapolated to 32 GIIz from 25 GIIz, which marks the 
upper-frequency limit of the absolute measurements that 
have been published for this source. 

The flux density, 5, for Venus is calculated from the 
disk-averaged brightness temperature and the solid angle 
of the planet. The expression for the flux density, using the 
Rayleigh- Jeans approximation to the blackbody radiation 
equation, is 


Several other sources that would have been useful are 
not included in this article. Some were simply not available 
during the time scheduled for the experiments, and others 
were eliminated because of time constraints. One of these 
was the planet Mars, which was not favorably positioned 


_ (2 kT)Q 
b A 2 


(18) 


where A = 0.009375 m is the wavelength of the observa- 
tions, and T = 475 ± 25 K (1<t) is the brightness tempera- 
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ture of Venus at 32 GHz [6]. The solid angle of the planet, 
Q = 7 tv 2 /d 2 , was calculated with the known distance, d, 
of Venus for each observation. The effective radius, r, of 
the thermally emitting atmosphere of Venus is 6120 km 
[7]. Table 4 shows the flux density of Venus for the two 
separate days it was observed. 

C. Radio Source Size Correction Factors 

It is an unfortunate fact that radio sources bright 
enough for microwave calibration purposes are either time 
variable or they are partially resolved by the compact 
beamwidths of modern antennas operating at short cen- 
timeter wavelengths. It is possible to correct for the par- 
tial resolution of sources if their angular dimensions are 
smaller than the antenna beamwidth. Only two of the ob- 
served sources, Venus and Virgo A, have angular dimen- 
sions large enough to require correction. Venus was not a 
problem because it was so far from Earth at the time that 
the diameter of the disk was only 0.0027 degree, while the 
angular beamsize is about 0.009 degree at 32 GIIz. A cor- 
rection factor to account for the partial resolution of the 
planet’s disk by the antenna beam was calculated; the re- 
sult is 1.03 ±0.006 (2 <t). 

Unfortunately, the source structure of Virgo A is nei- 
ther compact nor very well-known at 32 GIIz. Maps of 
Virgo A have not been published at frequencies above 
23 GIIz, but relatively good estimates of frequency de- 
pendence of the source structure have been made by M. J. 
Klein. These estimates, derived from maps of the source 
made at 10 frequencies between 400 MHz and 23,000 MIIz, 
show that the Virgo A size correction for our measure- 
ments should be near C T = 1.80. While the structure of 
the source is known to become more compact at succes- 
sively higher frequencies, the source structure at 32 GIIz is 
poorly understood. For this reason, we obtained an empir- 
ical estimate of C r by scanning the antenna beam through 
the source position several times in two orthogonal planes 
(azimuth and elevation). The procedure was duplicated 
for the point source 3C273 at very similar azimuth and 
elevation angles. By comparing the response functions of 
the scans from the two sources, the value of C r can be 
estimated from the apparent beam-broadening produced 
by the scans across Virgo A, whose angular structure is 
only slightly smaller than the beam. With the information 
currently available, the best estimate of the value of C r is 
1.57 ±0.2 (2<j). 


V. Atmospheric Effects 

In order to provide antenna characteristics that are 
independent of the atmospheric attenuation, the atmo- 


spheric loss factor needs to be removed from the measure- 
ments. One technique (see [3]) to perform this correction 
uses an efficiency factor given by: 

L = L’ ec(2) (19) 

where L z is the loss factor (L and L z < 1) at the zenith 
direction, and z is the zenith angle, equal to 90 de- 
grees minus the elevation angle. For these calculations, 
L z = —0.193 dB = 0.9565, which is a typical value for 
the clear atmosphere present during the observations [3]. 
To determine the efficiency of the antenna for an ideally 
transparent atmosphere, the measured efficiency is divided 
by Eq. (19). This technique is sufficiently accurate for the 
measurements reported here. 

VI. 70-Meter Performance Measurement 

A. Observations 

After several sessions of calibrations and setup work, 
the observations were performed on May 3, 4, 6, 8, 11, 
15, and July 9, 1989 during both daylight and nighttime 
conditions. During all but one of these observations, there 
was exceptionally good weather. The sky was clear for 
most of the observations, and the effects of the high clouds 
that occasionally appeared were not detectable in the data. 
On one date, extremely high winds (« 17.8 m/sec) pre- 
cluded accurate calibrations, but useful information about 
the performance of the antenna in “near stow” wind condi- 
tions was obtained. Given the good observation conditions, 
there is high confidence in the results described below. 

B. Results 

1. Aperture efficiency. The efficiency as a func- 
tion of the elevation angle as directly measured with the 
radiometer is shown in Fig. 4. This figure shows the dis- 
crete data that were measured duiing the observations for 
all the sources as well as a second-order fit to the data set. 
Figure 5 shows the efficiency as a function of elevation for 
the antenna with the effects of the atmosphere removed. 
Also shown in this figure is a third-order fit to the data 
which clearly shows the effects of the exponential secant (z) 
in the lower elevation angles. The aperture efficiency, as 
calculated using the curve fit at the reflector set angle of 
45 degrees, is found to be 35 percent. 

For each data point on the figures, the measurement 
time is on the order of 8 minutes, which is not optimal for 
a beam-switching radiometer. This time constraint arose 
due to a known cyclic oscillation in the hour-angle track- 
ing of the 70-meter antenna that causes the beam to be 
pointed alternately ahead or behind the source by approx- 
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imately 0.004 degree, which is approximately equal to the 
3 dB beamwidth. This oscillation has a period of 4 min- 
utes; therefore, to perform an observation, one has to chart 
the data for 4 minutes to select the peak signal, and then 
run the noise diode calibrator. This operation thus re- 
quires 8 minutes. This condition is an example of one use 
of this radiometer: to determine advisable upgrades for fu- 
ture systems. The problem has been extensively studied 
by the Ground Antenna and Facilities Engineering Section, 
and solutions will soon be implemented. 

2. Beam shape as a function of elevation. 

The beam shape of the antenna was measured by per- 
forming scans in elevation and cross-elevation. The 3 dB 
beamwidth at the reflector set angle is 0.009 degree. The 
beamwidth changes as a function of the antenna elevation 
angle. Wider beamwidths are seen at the lower and higher 
elevation angles, whereas the narrowest beamwidths are 
found near the set elevation angle of 45 degrees. For the 
data shown in Figs. 4 and 5, source size correction factors 
are used that are constant as a function of the antenna 
elevation angle; however, some error is incurred because 
the antenna beamwidth changes as a function of the angle. 
This is because the correction factor is related to the an- 
tenna beamwidth. Further work must be done to quantify 
the proper function for the corrections due to finite source 
sizes; this is another reason to calibrate relative perfor- 
mance using secondary very small, but perhaps variable, 
sources. 

3. Pointing. Focus and pointing of the 70-meter an- 
tennas of the DSN are achieved by rotating the subreflector 
to position the system focus at the feed position on the fo- 
cal ring. This technique allows a multitude of feeds to be 
used on the same reflector system. However, the feed sys- 
tem of this radiometer is placed at a position where there 
is no mechanical pin stop for exact positioning of the sub- 
reflector, as there is typically. Analysis showed that the 
likely error in subreflector positioning due to the lack of 
this stop would yield a beam direction error of approxi- 
mately 0.002 degree. This guarantees that the source will 
always be within the main beam of the antenna. Further- 
more, the operational technique for positioning the subre- 
flector includes positioning it to 0 degree, setting the pin 
that exists for that position, resetting the synchro output, 
and then moving the subreflector to the 32-GHz feed po- 
sition. In this way, the subreflector always approaches the 
desired position from the same direction with the same 
velocity. For each observation, a pointing exercise is per- 
formed that locates the beam bias for the remainder of the 
observation period. Beam “peaking” is continued during 
the observation period to assure maximum signal. The 
pointing model that is used for any feed in the XKR feed- 


cone is accurate enough to ensure that 32-GHz sources al- 
ways remain within the main beam of the antenna, which is 
an excellent starting position. Unfortunately, the pointing 
data that are recorded from observation period to obser- 
vation period are usually not repeatable. 

4. Focus. The focus of the reflector changes as a 
function of elevation, as it does with pointing. However, 
the focus change is repeatable from day to day. The focus 
is defined as the subreflector location along the axis normal 
to the aperture plane of the reflector that maximizes the 
RF signal. Figure 6 shows a focus curve of signal versus 
z-axis position at 27 degrees elevation angle. 

C. Comparison to Prediction 

The measured value of peak efficiency (35 percent at 
45.5 degrees elevation) and the measured efficiency with el- 
evation function were compared to predicted values for the 
70-meter antenna. With knowledge of the surface rough- 
ness of the 70-meter antenna at Golds tone and a “long- 
wave” efficiency, i.e., efficiency for a perfectly smooth re- 
flector surface, the 32-GHz performance was predicted. 1 
This was done using Ruze’s equation for gain degradation 
due to random surface errors [8]. The expected efficiency 
at the set angle of 45 degrees elevation was 0.313. The 
measured value, which was based on assumed flux densi- 
ties, was 0.35 ± 0.5 (2 cr). The difference, 0.04, was less 
than the uncertainty of the measurements. 

VII. Error Sources 

The repeatability of the data suggests that the pre- 
cision of the measurements reported here is very high. 
The noise diode data are tightly clustered about the mean 
value; the system response to radio sources varies little 
from observation to observation on a relative scale; and 
the system temperature and sensitivity are highly repeat- 
able from day to day. In contrast, the absolute accuracy is 
much less precise. Several factors that might degrade the 
accuracy have been identified by category: radio source 
parameters, noise diode calibration, atmospheric attenua- 
tion, and instrumentation. 

As discussed above, the flux densities of the radio 
sources are known with uncertainties of approximately 10 
percent (2a). This source of error might explain why the 
aperture efficiency values used on Venus are so different 
from those derived from Virgo A. The peak efficiency (near 


1 D. A. Bathker, “DSS 14 70M Efficiencies Above X-Band Fre- 
quencies,” JPL Interoffice Memorandum No. 3328-89-0109 
(internal document), Jet Propulsion Laboratory, Pasadena, 
California, May 23, 1989. 


212 



45 degrees elevation) from Venus is about 0.38, whereas the 
corresponding value for Virgo A is only 0.32. The mean 
of the values is 0.35, which represents our best estimate of 
the peak efficiency. 

It is difficult to determine which source might be more 
reliable for calibration purposes. The flux density of Virgo 
A may be incorrect and the correction for source structure 
(CV) is poorly known. The flux density spectrum is based 
primarily on observations below 20 GHz [5]. As measure- 
ments at higher frequencies are added, the spectral slope 
of this multi-component radio source is expected to depart 
from the linear approximation that is currently in use. 

Venus, on the other hand, is expected to exhibit a 
smoothly variable, featureless spectrum at short centime- 
ter wavelengths [6], However, the temperature of the Venu- 
sian atmosphere, on an absolute scale, is not tightly con- 
strained by models calculated from Earth-based and space- 
craft data. 

Since neither source is preferable, it was assumed that 
the mean value is the best estimate of the true efficiency. 
The Venus data were multiplied by 0.92, and the Virgo A 
data by 1.08 to remove the adverse effects of the discrep- 
ancy on the data plotted in Fig. 5. 

Another contributor to error is the noise diode used 
to set the calibration scale. Errors in the calibration of 
this noise diode level may be present due to several things: 
the actual temperature of the cold load is not known (it is 
assumed to be 77 K); the effect of the liquid nitrogen on 
the system (reflection, absorption, etc.) may not be negli- 
gible at this frequency; ice may have formed in or on the 
styrofoam tub used to hold the absorber and nitrogen; and 
there are always errors in the measurement of the various 
Y-factors [Eqs. (8) through (11)]. The random effect of 
the atmosphere has been minimized using a beamswitch- 
ing technique, and the zenith attenuation was assumed, 
not measured. The system linearity is good but not per- 
fect, and the detectors and power meters also introduce 
errors. 

Finally, the attenuation of the signal due to the axial 
focus of the subreflector affects the measurements. This ef- 
fect was investigated during the observations by systemati- 
cally moving the subreflector in the axial direction (Fig. 6). 
It was found that the autofocus program that is operating 
on DSS-14 is near correct, and the signal was frequently 
maximized in order to minimize this error source. 

Tables 5 and 6 list contributors to the calibration er- 
rors for the noise diode and system efficiency, respectively. 
The amount of uncertainty for each contributor is an esti- 


mate based on typical measurements of this kind on this 
present undertaking. 

An attempt was made to minimize the error sources 
over which there was control. Diode calibrations were per- 
formed in 5 minutes or less to minimize ice formation prob- 
lems. Thirty-one diode calibrations were performed using 
different experimenters to obtain a statistical average of 
the diode level. This made it possible to quantify ran- 
dom errors. The system was fine-tuned to be as linear as 
possible. Because of these measures and because observa- 
tions were performed carefully, it is possible that the ab- 
solute error in the efficiency measurements is greater than 
10 percent (2 <r). Given the excellent agreement between 
the measured and the predicted values of efficiency for this 
antenna, it is also possible that sensible estimates of the 
flux density of the radio sources have been obtained, at 
least for preliminary performance estimating purposes. 

VIII. Concluding Remarks 

A series of radiometer calibrations and observations on 
the 70-meter antenna at Goldstone have been performed 
in order to characterize its performance at 32 GHz. These 
observations were done using a beamswitching radiometer 
with an uncooled HEMT, placed in the XKR feedcone. 
The calibrations show highly repeatable results indicat- 
ing a high precision. However, several error sources that 
will affect the absolute accuracy of the measurements are 
known. It is calculated that this accuracy is on the order 
of 10 percent, which in any event is probably less than the 
absolute accuracy of the natural source flux densities. 

In order to perform calibrations of higher accuracy, 
a program is being developed between Caltech and JPL 
to perform observations of 100 or more radio sources at 
the Owens Valley Radio Observatory. The combination of 
a wideband cooled IIEMT amplifier on a high-efficiency 
(1.5-meter clear aperture) gain standard reflector antenna 
to measure the very strongest sources and a larger 5-meter 
antenna to measure numerous weaker sources should al- 
low for significantly increased confidence in natural radio 
source flux densities at 32 GHz. In particular, the ap- 
parent discrepancy between the aperture efficiency values 
based on Venus measurements and those based on Virgo 
A will be investigated. 

The gain standard reflector has demonstrated good 
agreement between theoretical and measured estimates 
and will be very accurately calibrated with National In- 
stitute of Standards and Technology traceability. This an- 
tenna will be used to accurately calibrate the gain of the 
5-meter antenna and thus calibrate the flux density of the 
100 or more sources. 
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Table 1. System temperature contributions for 32-GHz radiometer 3 


Item 

Loss, b dB 

Temperature, K 
Contribution 

Temperature, K 
at Reference Plane 

Comments 

1. Waveguide 

-0.75 

56.55 

56.55 

Measured 

2. LNA 

+50.0 

281.5 

334.56 

Measured 

3. T a tm 

— 

— 

7.87 

Estimated[3] 

4. T 'qs + T so 

— 

— 

6.5 

Estimated [4] 

5. T ga i 

— 

— 

3.0 

Assumed 

6. Tj 

— 

0.84 

1.0 

Estimated 

7. T a 

— 

— 

17.37 

Lines3 + 4 + 5 

00 

— 

— 

392.11 

Lines 1 + 2 + 6 

9. Top 

— 

— 

409.48 

Lines 7 + 8 


a Reference plane at the aperture of the feed horn. 
b Positive value means gain; negative value means attenuation. 


Table 2. 32-GHz radiometer operating settings 


Item 

Setting 

Comments 

Dicke switching rate 

20 Hz 

Square- wave rate 

Post-detection time constant 

1 sec 

On lock-in amplifier 

Pre- detection time constant 

30 msec 

On lock-in amplifier 

Lock-in amp phase for peak 

— 23 degrees 

On lock-in amplifier 

IF bandwidth 

32 MHz 

IF filter 

IF center frequency 

64 MHz 

IF filter 

Level set 

10.5 dB 

Square-law detector 


Table 3. 32-GHz system temperature measurements at DSS-14 


Item 

Temperature, K 

Standard Deviation, K 

T n d 

10.3 

0.2 

T a 

16.1 

0.5 

T c 

399.6 

13.5 

T 0 p 

415.7 

13.8 
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Table 4. 32-GHz calibration sources 


IAU 

Other names 

Measured T s 


Flux density S, 

^100 percent’ ^ 

Designation 

(el = 45 degrees), K 

O r 

Measured b 

Assumed 

1228 + 126 

Virgo A, 3C274 

4.39 

1.57 

*** 

14.68 

13.03 

* * * 

Venus (4 May 89) 

14.13 

1.03 

*** 

26.70 

36.13 

*** 

Venus (8 May 89) 

14.48 

1.03 

*** 

26.90 

36.40 

0316 + 412 

3C84 

21.11 

1.00 

43.71 

*** 

60.92 

1226 — 023 

3C273 

14.27 

1.00 

29.54 

*** 

41.17 

1328 +307 

3C286 

0.98 

1.00 

1.94 

*** 

2.70 

2134 +004 

Parks 2134 + 004 

1.87 

1.00 

3.87 

*** 

5.39 

a Jy = Jansky = 
b Epoch 1989.4 

= 10 - 26 Wm- 2 Hz- 1 







Table 5. Error contributors to the noise diode calibration 
(all errors are estimated with confidence of 2a) 


Error Contributor 

Error Amount 

A Tnd 

A Th 

±1 K 

±0.05 K 

A Tc 

±1 K 

±0.05 K 

av 4 

±0.1 percent 

±0.45 K 

LN2 mismatch 

±0.1 percent 

±0.45 K 

Linearity 

±2 percent 

±0.05 K 

RSS error 


±0.64 K 


Table 6. Error contributors to efficiency (ail errors are estimated with confidence of 2a) 


Error Amount A r) 


Error contributor 

Venus 

Virgo A 

Venus, 

percent 

Virgo A, 
percent 

Absolute flux 

±11 percent 

±12 percent 

±4.4 

±3.7 

cv 

±0.6 percent 

±7 percent 

±0.2 

±2.5 

Source Measurement 

±0.45 Jy 

±0.61 Jy 

±0.7 

±1.5 

T n d 

±0.64 K 

±0.64 K 

±2.0 

±2.0 

T a tm 

±2 percent 

±2 percent 

±1.0 

±1.0 

Z- focus 

±1 inch 

±1 inch 

±0.8 

±0.8 

RSS error 



±5.05 

±5.28 



BEAM 1 


BEAM 2 



STRIP CHART 
RECORDER 


Fig. 1. Block diagram of the 32-GHz beamswitching radiometer. 
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Fig. 2. Dlcke radiometer front end. 
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F)g. 4. DSS-14: 32-GHz efficiency Including the atmospheric 
effects (includes second- order fit). 
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Fig. 5. DSS-14: 32-GHz efficiency; atmospheric effects removed 
from the data (Includes third-order curve fit). 


Fig. 3. Top view of XKR feedcone showing various feed systems. 
(32-GHz radiometer is second from left.) 
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Errata 

P. H. Richter and S. D. Slobin (Telecommunications Systems Section) have sub- 
mitted the following errata to their article “DSN 70-Meter Antenna X- and S-Band 
Calibration Part I: Gain Measurements,” that appeared in the Telecommunications 
and Data Acquisition Progress Report ^2-97 ) Vol. January-March 1989, June 15, 
1989: 

As the result of a somewhat too severe round-off, the coefficients appearing 
at the top of page 319 do not reproduce the X-band flux density value of Sq = 
45.2037 Jy for the radio calibration source Virgo A (3C274) given in the article. 

The values given 

a 0 ~ 3.9964 

a x =0.1733 


a 2 =-0.3341 


&3 = 0.0352 
should be changed to 
ao = 3.9964 
a x = 0.17327 


a 2 = —0.33415 


a 3 = 0.035172 

The X-band Virgo A flux density given on page 319 should be rounded to 
45.20 Jy and the S-band value of 138.48 Jy listed on page 323 should be reduced to 
the value 137.93 Jy. 

These changes result in less than 0.001 dB of gain increase at X-band and a 
0.017-dB gain increase at S-band. 
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